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Field of the Invention: 

This invention relates to newly identified polynucleotides, polypeptides encoded by certain of these polynucle- 
otides, molecular complexes of RNAs and polypeptides, the uses of such polynucleotides and polypeptides, as well 
as the production of such polynucleotides and polypeptides and recombinant host cells transformed with the polynu- 
cleotides. The invention relates particularly to such polynucleotides and polypeptides from Staphylococci, especially 
S. aureus. This invention also relates to inhibiting the biosynthesis, assembly or action of such polynucleotides and/ 
or polypeptides and to the use of such inhibitors in therapy 



Background of the invention: 



20 This invention relates to a novel bacterial ribonucleoprotein complex and the component parts thereof. More spe- 

cifically, this invention relates to RNase P, particularly RNase P from Staphylococcus aureus, apd the use of RNase P 
or compg n^ntQ thftr gpf in screens for the identification of antimicrobial compounds a nd to the use of such compounds ^ 
In therapy. 

The Staphylococci make up a medically important genera of microbes. They are known to produce two types of 
25 (jisease. invasive and toxigenic. Invasive infections are characterized generally by abscess fomnation effecting both 
skin surfaces and deep tissues. S. aureus is the second leading cause of bacteremia in cancer patients. Osteomyelitis, 
septic arthritis, septic thrombophlebitis and acute bacterial endocarditis are also relatively common. There are at least 
three clinical conditions resulting from the toxigenic properties of Staphylococci. The manifestation of these diseases 
result from the actions of exotoxins as opposed to tissue invasion and bacteremia. These conditions include; Staphy- 
30 locQCcal food poisoning, scalded skin synuiorne and toxic shock syndrome. 

The frequency of Staphylococcus aureus infections has risen dramatically in the past 20 years. This has been 
attributed to the emergence of multiply antibiotic resistant strains and an increasing population of people with weakened 
immune systems. It is no longer uncommon to isolate Staphylococcus aureus strains which are resistant to some or 
all of the standard antibiotics. This has created a demand for both new anti-microbial agents and diagnostic tests for 
35 this organism. 

While certain Staphylococcal proteins associated with pathogenicity have been identified, e.g., coagulase, hemo- 
lysins, leucocidins and exo- and enterotoxins, additional targets are always useful because it is appreciated that the 
target of a antimicrobial screen can often bias the outcome. Thus, new targets allow for the discovery of new classes 
of antimicrobials. 

40 

Brief Description of the Invention: 

This invention provkies a novel ribonucleoprotein complex, particularly such complex from Staphylococcus aureus, 
and the separately isolated RNA and protein components thereof. 

In accordance with another aspect of the present invention, there are provided polynucleotides (DNA or RNA) 
which encode the protein and RNA components of such a complex. 

In particular the invention provides polynucleotides having the DNA sequences given herein. 

The invention also relates to novel oligonucleotides derived from the sequences given herein which can act, for 
example, asantisense inhibitors of the expression of the RNA or protein components. The oligonucleotides or fragments 
50 or derivatives thereof can also be used to directly inhibit catalytic activity or indirectly inhibit activity by interference 

with RNA protein complex formation. The protein and the RNA components, either separately or in a complex, are also 

useful as targets in screens designed to identify antimicrobial compounds. 

It is an object of the invention to provide polypeptides that have been identified as novel RNaseP polypeptides by 
homology between the amino acid sequence set out in Table 1 [SEQ ID NO: 2] and a known amino acid sequence or 
55 sequences of other proteins, such as S. subtilis RNase P protein. 

It is a further object of the invention to provide polynucleotides that encode RNase P polypeptides, particularly 
polynucleotides that encode the polypeptide herein designated RNaseP, as well as polynucleotides that are transcribed 
into RNase P RNA, particularly catalytic RNA. 
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In a particularly preferred embodiment of the invention the polynucleotide comprises a region encoding RNaseP 
polypeptides comprising the sequence set out in Table 1 [SEQ ID NO:1] which includes a full length gene, or a variant 
thereof. 

In another particularly preferred embodiment of the invention there is a novel RNaseP protein from Staphylococcus 
5 aureus comprising the amino acid sequence of Table 1 .[SEQ ID NO;2]. or a variant thereof. 

In accordance with another aspect of the invention there is provided an isolated nucleic acid molecule encoding 
a mature polypeptide expressible by the Staphylococcus aureus WCUH 29 strain contained in the deposited strain. 

A further aspect of the Invention there are provided isolated nucleic acid molecules encoding RNaseR particularly 
Staphylococcus aureus RNaseP, including mRNAs, cDNAs. genomic DNAs and catalytic RNAs. Further embodiments 
10 of the invention include biologically, diagnostically. prophylactically, clinically or therapeutically useful variants thereof, 
and compositions comprising the same. 

In accordance with another aspect of the Invention, there is provided the use of a polynucleotide of the invention 
for therapeutic or prophylactic purposes, in particular genetic immunization. Among the particularly preferred embod- 
iments of the invention are naturally occurring allelic variants of RNaseP and polypeptides encoded thereby 
IS Another aspect of the invention there are provided novel polypeptides of Staphylococcus aureus referred to herein 

as RNaseP as well as biologically, diagnostically, prophylactically, clinically or therapeutically useful variants thereof, 
and compositions comprising the same. 

Among the particularly preferred embodiments of the invention are variants of RNaseP polypeptide encoded by 
naturally occurring alleles of the RNaseP gene. 
20 In a preferred embodiment of the invention there are provided methods for producing the aforementioned RNaseP 

polypeptides. 

In accordance with yet another aspect of the invention, there are provided inhibitors to such polypeptides, useful 
as antibacterial agents, including, for example, antibodies. 

In accordance with certain preferred eml^odiments of the invention, there are provided products, compositbns and 
2S methods for assessing RNaseP expression, treating disease, for example, disease, such as, infections of the upper 
respiratory tract (e.g., otitis media, bacterial tracheitis, acute epiglottitis, thyroiditis), lower respiratory (e.g., empyema, 
lung abscess): cardiac (e.g., Infective endocarditis), gastrointestinal (e.g., secretory diarrhoea, splenic abscess, retro- 
peritoneal abscess), CNS (e.g., cerebral abscess), eye (e.g., blepharitis, conjunctivitis, keratitis, endophthalmitis, 
proGcpta! and orbital ceHuliiis, uarcryocysiitis), Kidney and urinary tract (e.g., epididymitis, intrarenal and perinephric 
30 absces, toxic shock syndrome), skin (e.g., impetigo, folliculitis, cutaneous abscesses, cellulitis, wound infection, bac- 
terial myositis) bone and joint (e.g., septic arthritis, osteomyelitis), assaying genetic variation, and administering a 
RNaseP polypeptide or polynucleotide to an organism to raise an immunological response against a bacteria, especially 
a Staphylococcus aureus bacteria. 

In accordance with certain preferred embodiments of this and other aspects of the invention there are provided 
35 polynucleotides that hybridize to RNaseP polynucleotide sequences, particularly under stringent conditions. 

In certain preferred embodiments of the invention there are provided antibodies against RNaseP polypeptides. 
In other embodiments of the invention there are provided methods for Identifying compounds which bind to or 
otherwise interact with and inhibit or activate an activity of a polypeptide or polynucleotide of the invention comprising: 
contacting a polypeptide or polynucleotide of the invention with a compound to be screened under conditions to permit 
40 binding to or other interaction between the compound and the polypeptide or polynucleotide to assess the binding to 
or other interaction with the compound, such binding or interaction being associated with a second component capaible 
of providing a detectable signal in response to the binding or interaction of the polypeptide or polynucleotide with the 
compound; and determining whether the compound binds to or otherwise interacts with and activates or inhibits an 
activity of the polypeptide or polynucleotide by detecting the presence or absence of a signal generated from the binding 
or interaction of the compound with the polypeptide or polynucleotide. 

In accordance with yet another aspect of the invention, there are provided RNaseP agonists and antagonists, 
preferably bacteriostatic or bacteriocidal agonists and antagonists. 

In a further aspect of the invention there are provided compositions comprising a RNaseP polynucleotide or a 
RNaseP polypeptide for administration to a cell or to a multicellular organism. 
50 Various changes and modifications within the spirit and scope of the disclosed invention will become readily ap- 

parent-tO'thosesktiled-in-the-art-from-reading-the^GlkDwing-descriptions-and-from-reading-tlie-Gther-^^ 

disclosure. 

Detailed Description of th Figures: 

55 

Figure 1 illustrates the secondary structural modeling of RNase P RNAs based upon published sequence infor- 
mation. 

Figure 2 illustrates the sequence alignment of the Staphylococcal RNase P protein component o1 the invention 



3 



EP0 811 688 A2 



compared to other protein components reported in the literature. 

Figure 3 illustrates the Staphylococcal DNA encoding a region of the RNase P gene [SEQ ID. NO: 14] encoding 
the RNA component used to clone the intact gene compared with the DNA sequence encoding the gene for the RNase 
P RNA component from B. subtilis. 
5 Figure 4 illustrates the predicted secondary structure for the RNA component of the RNase P of this invention 

compared to the RNA component from B. subtilis. 

Figure 5 illustrates the amino acid sequence of the protein component of the S. aureus RNase P and the sequence 
of a DNA encoding same [SEQ ID NOS: 1 and 2]. 

Figure 6 illustrates a schematic of a whole cell rescue assay to identify drug/RNA interaction. 
10 Figure 7 illustrates a schematic of a screen to identity compounds that disrupt RNA/proiein interactions. 

Figure 8 illustrates an example of a minimal RNase P substrate embodiment [SEQ ID NO: 4]. 

Figure 9 shows a schematic diagram of a specific embodiment of the first amplification step of an RNase P gene. 

Figure 10 shows a schematic diagram of a specific embodiment of the second amplificatbn step of an RNase P 
gene. 

15 

Glossary: 

The following definitions are provided to facilitate understanding of certain terms used frequently herein. Certain 
other definitions are provided elsewhere herein. 

20 "Host cell" is a cell which has been transformed or transfected, or is capable of transformation or transfection by 

an exogenous polynucleotide sequence. 

"Identity," as known in the art, is a relationship between two or more polypeptide sequences or two or more poly- 
nucleotide sequences, as determined by comparing the sequences. In the art, "identity" also means the degree of 
sequence relatedness between polypeptide or polynucleotide sequences, as the case may be, as determined by the 

25 match between strings of such sequences. "Identity" and "similarity" can be readily calculated by known methods, 
including but not limited to those described in (Computational Molecular Biology, Lesk, A.M.. ed., Oxford University 
Press, New York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., Academic Press, New 
York, 1993; Computer Analysis of Sequence Data, Part I, Griffin. A.M.. and Griffin, H.G., eds., Humana Press, New 
Jersey 1 994; Sequence Analysis in Molecular Biology von Heinje, G., Academic Press^ 1 987; and Sequence Analysis 

'^0 Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991 ; and Carlllo, H., and Lipman, D,, SI AM 
J. Applied Math., 48: 1073 (1988). Preferred methods to determine identity are designed to give the largest match 
between the sequences tested. Methods to determine identity and similarity are codified in publicly available computer 
programs. Preferred computer program methods to determine identity and similarity between two sequences include, 
but are not limited to, the GCG program package (Devereux, J., et al., Nucleic Acids Research 12(1): 387 (19S4)), 

35 Bl_ASTP. BLASTN. and FASTA (Atschui, S.F. et aL. J. Molec. Biol. 215: 403-410 (1990). The BLAST X program is 
publicly available from NCBI and other sources (BLAST Manual, Altschul. S., et al., NCBI NLM NIH Bethesda, MD 
20894; Altschul, S., et a!., J. Mol. Biol. 215: 403-410 (1 990). As an Illustration, by a polynucleotide having a nucleotide 
sequence having at least, for example, 95% "identity" to a reference nucleotide sequence of SEQ ID NO: 1 it is intended 
that the nucleotide sequence of the polynucleotide is identical to the reference sequence except that the polynucleotide 

40 sequence may induce up to five point mutations per each 100 nucleotides of the reference nucleotide sequence of 
SEQ ID NO: 1. In other words, to obtain a polynucleotide having a nucleotide sequence at least 95% identical to a 
reference nucleotide sequence, up to 5% of the nucleotides in the reference sequence may be deleted or substituted 
with another nucleotide, or a number of nucleotides up to 5% of the total nucleotides in the reference sequence may 
be inserted into the reference sequence. These mutations of the reference sequence may occur at the 5 or 3 terminal 
positions of the reference nucleotide sequence or anywhere between those terminal positrons, interspersed either 
individually among nucleotides in the reference sequence or in one or more contiguous groups within the reference 
sequence. Analogously , by a polypeptide having an amino acid sequence having at least, for example, 95% identity 
to a reference amino acid sequence of SEQ ID NO:2 is intended that the amino acid sequence of the polypeptide Is 
identical to the reference sequence except that the polypeptide sequence may include up to five amino acid alterations 

50 per each 100 amino acids of the reference amino acid of SEQ ID NO: 2. In other words, to obtain a polypeptide having 

an.amino.acid-sequence-atJeast-95% identical to.axeferenceamino.acid-sequence,-up4o-5%.of_the.amino-acidJesi 

in the reference sequence may be deleted or substituted with another amino acid, or a number of amino acids up to 
5% of the total amino acid residues in the reference sequence may be inserted into the reference sequence. These 
alterations of the reference sequence may occur at the amino or carboxy terminal positions of the reference amino 

55 acid sequence or anywhere between those terminal positrons, interspersed either individually among residues in the 
reference sequence or in one or more contiguous groups within the reference sequence. 

"Isolated" means altered "by the hand of man" from its natural state, i.e., if it occurs in nature, it has been changed 
or removed from its original environment, or both. For example, a polynucleotide or a polypeptide naturally present in 
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a living organism is not "isolated." but the same polynucleotide or polypeptide separated from the coexisting materials 
of its natural state is 'isolated*', as the term is employed herein. 

''Polynucleotlde(s)* generally refers to any polyribonucleotide or pofydeoxribonucleotide, which may be unmodified 
RNA or DNA or modified RNA or DNA. ''Poiynucleotide(s)" Include, without limitation, single- and double-stranded 

5 DNA; DNA that Is a mixture of single- and double-stranded regions or single-, double- and triple-stranded regions, 
single- and double-stranded RNA, and RNA that is mixture of single- and double-stranded regions, hybrid molecules 
comprising DNA and RNA that may be single-stranded or, more typically, double-stranded, or triple-stranded regions, 
or a mixture of single- and double-stranded regions. In addition, "polynucleotide" as used herein refers to triple-stranded 
regions comprising RNA or DNA or both RNA and DNA. The strands in such regions nnay be from the same molecule 

10 or from different molecules. The regions may include all of one or more of the molecules, but more typically involve 
only a region of some of the molecules. One of the molecules of a triple-helical region often is an oligonucleotide. As 
used herein, the term '•polynuclectide(s)' also includes DNAs or RNAs as described above that contain one or more 
modified bases. Thus, DNAs or RNAs with backbones modified for stability or for other reasons are ''polynucleotide(s) 
" as that term is intended herein. Moreover, DNAs or RNAs comprising unusual bases, such as inosine, or modified 

15 bases, such as tritylaled bases, to name just two examples, are polynucleotides as the term is used herein. It will be 
appreciated that a great variety of modifications have been made to DNA and RNA that serve many useful purposes 
known to those of skill in the art. The temn "polynucleotidefs)" as it is employed herein embraces such chemically, 
enzymatically or metabollcally modified forms of polynucleotides, as well as the chemical forms of DNA and RNA 
characteristic of viruses and cells, including, for example, simple and complex cells. ''Polynucleotide(s)'' also embraces 

20 short polynucleotides often referred to as oligonucleotide(s). 

''Polypeptide(s)'' refers to any peptide or protein comprising two or more amino acids joined to each other by peptide 
bonds or modified peptide bonds. ''Polyp€ptide(s)*' refers to both short chains, commonly referred to as peptides, oli- 
gopeptides and oligomers and to longer chains generally referred to as proteins. Polypeptides may contain amino acids 
other than the 20 gene encoded amino acids. "Polypeptide(s)" include those modified either by natural processes, 

25 such as processing and other post-translational modifications, but also by chemical modification techniques. Such 
modifications are well described in basic texts and in more detailed monographs, as well as in a voluminous research 
literature, and they are well known to those of skill in the art. It will be appreciated that the same type of modification 
may be present in the same or varying degree at several sites in a given polypeptide. Also, a given polypeptide may 
contain many types of modifications. Modifications can occur anywhere in a polypeptide, including the peptide back- 

$c bone, the amino acid side-chains, and the amino or carboxyl termini. Modifications include, for example, acetylation, 
acylation, ADP-ribosylation, amidation, covalent attachment of flavin, covaient attachment of a heme moiety, covalent 
attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, covalent attachment 
of phosphotidylinositol, cross-linking, cyclization, disulfide bond formation, demethylation, formation of covalent cross- 
links, fornnation of cysteine, fomnation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI anchor 

55 formation, hydroxylation, iodination, methylation, myristoylation, oxidation, proteolytic processing, phosphorylation, 
prenylation, racemization. glycosylation, lipid attachment, sulfation, gamma-carboxylation of glutamic acid residues, 
hydroxylation and ADP-ribosylation, selenoylation, sulfation, transfer-RNA mediated addition of amino acids to pro- 
teins, such as arginylation. and ubiquitination. See, for instance, PROTEINS - STRUCTURE AND MOLECULAR 
PROPERTIES, 2nd Ed.. T. E. Creighton, W. H. Freeman and Company, New York (1 993) and Wold, F., Posttranslational 

40 Protein Modifications: Perspectives and Prospects, pgs. 1 -1 2 in POSTTRANSLAUONAL COVALENT MODIFICATION 
OF PROTEINS, B. C. Johnson, Ed., Academic Press, New York (1983); Seifter et al., Meth. EnzymoL 752,626-646 
(1990) and Rattan e! al.. Protein Synthesis: Posttranslational Modifications and Aging, Ann. N Y Acad. Sci. 663: 48-62 
(1992). Polypeptides may be branched or cyclic, with or without branching. Cyclic, branched and branched circular 
polypeptides may result from post-translational natural processes and may be made by entirely synthetic methods, as 

^ well. 

■'Variant(s)" as the term is used herein, is a polynucleotide or polypeptide that differs from a reference polynucleotide 
or polypeptide respectively, but retains essential properties. A typical variant of a polynucleotide differs in nucleotide 
sequence from another reference polynucleotide. Changes in the nucleotide sequence of the variant may or may not 
alter the amino acid sequence of a polypeptide encoded by the reference polynucleotide. Nucleotide changes may 

so result in amino acid substitutions, additions, deletions, fusions and truncations in the polypeptide encoded by the ref- 

er.enc_e_se-quenae^_as_discussed.b.elo_w.^JypicaLvarianl-of^a4^^^ 

reference polypeptide. Generally, differences are limited so that the sequences of the reference polypeptide and the 
variant are closely similar overall and, in many regions, identical. A variant and reference polypeptide may differ in 
amino acid sequence by one or more substitutions, additions, deletions in any combination. A substituted or inserted 

55 amino acid residue may or may not be one encoded by the genetic code, A variant of a polynucleotide or polypeptide 
may be a naturally occurring such as an allelic variant, or it may be a variant that is not known to occur naturally Non- 
naturally occurring variants of polynucleotides and polypeptides may b made by mutagenesis techniques, by direct 
synthesis, and by other recombinant methods known to skilled artisans. 
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Detail d Description of th Invention: 

The ribonucleoprotein. RNase P. plays a key role in the biosynthesis at transfer RNA (tRNA), itself a key interme- 
diate in protein biosynthesis. RNase P functions to process precursor RNAs into mature tRNAs by endoribonucleolytic 

5 action. The complex in prokaryotes is composed of two'subunits: a catalytic RNA and protein co-factor. Recent reviews 
of certain RNase P molecules exist. See for example, LA. Kirsebom, Molecular Microbiology 17(3), 411-420 (1995) 
or N. R. Pace and J. W. Brown, J. Bacteriol . 177 (3), 1919-1928(1995). 

The inventbn relates to novel RNaseP polypeptides and polynucleotides as described in greater detail below. In 
particular, the invention relates to polypeptides and polynucleotides ot a novel RNaseP of Staphylococcus aureus, 

10 which is related by amino acid sequence homology to RNase P polypeptide set forth in Figures 2 and 5 and SEQ ID 
N0:2. The invention relates especially to RNaseP having the nucleotide and amino acid sequences set out in Table 
1, SEQ ID NO: I and SEQ ID NO: 2 respectively, and to the RNaseP nucleotide sequences of the DNA in the deposited 
strain and amino acid sequences encoded thereby. The invention also relates to the RNase P RNA component, par- 
ticularly in its catalytic for, and sequences from which such component is transcribed. 

15 

RNase P RNA Component: 

Phylogenetic comparisons readily allow secondary structure modeling and the identification of a minimum con- 
sensus structure (Figure 1). Data concerning the RNase P RNA structure are available in the RNase P database on 
20 the w.w.w.(http://jwbrown. mbio.ncsu.edu/RNaseP/home.html). A polynucleotide of the invention from which the RNA 
component is transcribed is set forth in Figure 3 [SEQ ID NO: 14]. 

In general, few nucleotides are conserved but compensatory base changes in hydrogen bonded regions show that 
the overall structure is preserved throughout eubacteria. Universal consen/ation of primary sequences (E. co// ; 61 -74, 
353-360) together with other conserved or quasi-consen/ed nucleotides implicate functional importance, the signifi- 
es cance of which remain unknown. To date all RNase P RNA molecules can be folded to fit a consensus 'cage-like' 
structure, beyond this domain there is no convincing structural similarity between prokaryotic and eukaryotic RNase 
P RNAs. 

RN3S6 P Protein Component: 



The precise functional role of the protein remains unknown. While it is appreciated that in Wfro the experimental 
conditions can be established such that the protein component is not necessary lor catalytic activity, in vh/o, the protein 
components appears to be required. A novel RNase P protein component from S. aureus has been identified and is 
characterized by the amino acid sequence given in Figure 2 [SEQ ID NO:2] in which the S. aureus (Sau) sequence is 
35 aligned with sequences from other microorganisms. 

The full length sequence encoding the intact RNase P protein component can be obtained by probing a genomic 
library by for example in situ colony hybridization detailed in Maniatis et al . (infra ) using a probe(s) generated based 
on the sequences given in Figure 5 [SEQ !D NOS: 1 and 2]. 

40 



TABLE 1 

RNaseP Polynucleotide and Polypeptide Sequences 

45 



50 
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(A) Sequences from Staphylococcus aureus RNaseP polynucleotide sequence [SEQ ID 
N0:1]. 

5 

5 ' -ATG TTA TTG GAA AAA GCT TAG CGA ATT AAA AAG AAT GCA GAT 
TTT CAG AGA ATA TAT AAA AAA GGT CAT TCT GTA GCC AAC AGA CAA 
TTT GTT GTA TAG ACT TGT AAT AAT AAA GAA ATA GAC CAT TTT CGC 

10 

TTA GGT ATT AGT GTT TCT AAA AAA CTA GGT AAT GCA GTG TTA AGA 
AAC AAG ATT AAA AGA GCA ATA CGT GAA AAT TTC AAA GTA CAT AAG 
TCG CAT ATA TTG GCC AAA GAT ATT ATT GTA ATA GCA AGA CAG CCA 

15 

GCT AAA GAT ATG ACG ACT TTA CAA ATA CAG AAT AGT CTT GAG CAC 
GTA CTT AAA ATT GCC AAA GTT TTT AAT AAA AAG ATT AAG TAA-3* 

20 
25 
30 
35 
40 
45 
50 
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(B) RNascP polypeptide sequence deduced from the polynucleotide sequence in this 
table [SEQ ID N0:2]. 

MH2- MLLEK VYRIK KNADF GRIYK KGHSV ANRQF WYTC NNKEI DHFRL 
GISVS KKLGN AVLRN KIKRA IRENF KVHKS HILAK DIIVI ARQPA KDMTT 
LQIQN SLEHV LKIAK VFNKK IK-COOH 



(C) Polynucleotide sequence embodiments [SEQ ID NO:l]. 



X-{Rl)n 


-ATG 


TTA 


TTG 


GAA 


AAA 


GCT 


TAC 


CGA 


ATT 


AAA 


AAG 


AAT 


GCA 


GAT 


TTT 


CAG 


AGA 


ATA 


TAT 


AAA 


AAA 


GGT 


CAT 


TCT 


GTA 


GCC 


AAC 


AGA 


CAA 


TTT 


GTT 


GTA 


TAG 


ACT 


TGT 


AAT 


AAT 


AAA 


GAA 


ATA 


GAC 


CAT 


TTT 


CGC 


TTA 


GGT 


ATT 


AGT 


GTT 


TCT 


AAA 


AAA 


CTA 


GGT 


AAT 


GCA 


GTG 


TTA 


AGA 


AAC 


AAG 


ATT 


AAA 


AGA 


GCA 


ATA 


CGT 


GAA 


AAT 


TTC 


AJUV 


GTA 


CAT 


AAG 


TCG 


CAT 


ATA 


TTG 


GCC 


AAA 


GAT 


ATT 


ATT 


GTA 


ATA 


GCA 


AGA 


CAG 


CCA 


GOT 


AAA 


GAT 


ATG 


ACG 


ACT 


TTA 


CAA 


ATA 


CAG 


AAT 


AGT 


CTT 


GAG 


CAC 


GTA 


CTT 


AAA 


ATT 


GCC 


AAA 


GTT 


TTT 


AAT 


AAA 


AAG ATT AAG TAA- 



{R2)n-Y 

(D) Polypeptide sequence embodiments [SEQ ID N0:2]. 

X-( Rx)n-MLLEK VYRIK KNADF GRIYK KGHSV ANRQF WYTC NNKEI DHFRL 
GISVS KKLGN AVLRN KIKRA IRENF KVHKS HILAK DIIVI ARQPA KDMTT 
LQIQN SLEHV LKIAK VFNKK IK-(R2)n"^ 

(E) Sequences from Staphylococcus aurem RNaseP RNA gene [SEQ ID N0:3]. 

5' -GTTCTGATATTTCGGGTAATCGCrATATTATATAGAGGAAAGTCCATGCTCCACAGTCTGAGATGCATT 

GT AGT GTT CGT G CTT GAT G AAA CAAT AAAT CAAGG CATT AATTT G ACGG CAAT G AAAT AT C CT AAGT CT 

TTCGATATGGATAGAGTAATTTGAAAGrGCCACAGrGAOGTAGCTTTTATAGAAATATAAAAGGTGGAA 

CG OSGT AJUCCCCr OG AGTG AG CAAT CCAAATTT GGT AGG AG CACTTGTTT AAOGG A ATT CAA CGT AT 

AAACGAGACACACrr C3 CG AAAT G AAGT GGT GTA 03 A CA.G AT GGT TAT CACCTGAGTACCAGTGTGA 

CT AGTG CACGTG ATG AGT ACG ATGGAACAGAACATGG CTT AT AGAAAT AT OA Cr A CT A 

G-3' 



(F) Polynucleotide sequence embodiments [SEQ ID N0:3]. 

GTT cr G AT ATTT CGGGT AAT CG CT AT ATT AT AT AG AGG AAAGT CCAT G CT CCA CAGT CT G AG AT G CATT 
GT AGT GTT CGT G CTT G ATG AAA CAAT AAAT CAAGG CATT AATTT G AOGG CAATG AAAT AT CCT AAGT CT 
TTOGATATGGATAGAGTAATTTGAAAGTGCCACAGTGAOGTAGCTTTTATAGAAATATAAAAGGTGGAA 
OG OGGT AAACCCCr 03 AGTG AG CAAT CCAAATTTGGT AGG AG CACTTGTTT AAOGGAATT CAA CGT AT 



8 



EP0 811 688 A2 



AAACGAGACACACTTCGCGAAATGAAGTGGTGTACGACAGATGGTTATCACCrGAGTACCAGTGTGA 
CTAGTGCXOGTGATGAGTACGATGGAACAGAACATGGCrTATAGAAATATCACrACTA 

^ G-(R2)n"Y 

Polvpeotides of the invention: 

The polypeptides of the invention include the polypeptide of Table 1 [SEQ ID NO:2] (in particular the mature 

10 polypeptide) as well as polypeptides and fragments, particularly those which have the biological activity of RNaseR 
and also those which have at least 70% identity to the polypeptide ot Table 1 [SEQ ID NO:21 or the relevant portion, 
preferably at least 80% identity to the polypeptide of Table 1 [SEQ ID NO:2]. and more preferably at least 90% similarity 
(more preferably at least 90% identity) to the polypeptide of Table 1 [SEQ ID NO:2] and still more preferably at least 
95% similarity (still more preferably at least 95% identity) to the polypeptide of Table 1 [SEQ ID NO: 2] and also include 

15 portions of such polypeptides with such portion of the polypeptide generally containing at least 30 amino acids and 
more preferably at least 50 amino acids. 

The invention also includes polypeptides of the formula set forth in Table 1 (D) [SEQ I D NO:2] wherein, at the amino 
terminus, X is hydrogen, and at the carboxyl terminus, Y is hydrogen or a metal, and is any amino acid residue, 
and n is an integer between 1 and 1000. Any stretch of amino acid residues denoted by either R group, where R is 

20 greater than 1 , may be either a heteropotymer or a homopolymer. preferably a heteropolymer. 

A fragment is a variant polypeptide having an amino acid sequence that entirely is the same as part but not all of 
the amino acid sequence of the aforementioned polypeptides. As with RNaseP polypeptides fragments may be Iree- 
standing," or comprised within a larger polypeptide of which they form a part or region, most preferably as a single 
continuous region, a single larger polypeptide. 

25 Preferred fragments include, for example, truncation polypeptides having a portion of the amino acid sequence of 

Table 1 [SEQ ID N0:2]. or of variants thereof, such as a continuous series of residues that includes the amino terminus, 
or a continuous series of residues that includes the carboxyl terminus. Degradation forms of the polypeptides of the 
invention in a host ceil, particularly a Staphylococcus aureus, are also preferred. Further preferred are fragments char- 
acierized by structural or funciionai annbutes such as fragments that comprise alpha-helix and alpha-helix forming 

30 regions, beta-sheet and beta-sheet-forming regions, turn and turn-forming regions, coil and coil-forming regions, hy- 
drophilic regions, hydrophobic regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, surface- 
forming regions, substrate binding regbn, and high antigenic index regions. 

Also preferred are biologically active fragments which are those fragments that mediate activities of RNaseR in- 
cluding those with a similar activity or an improved activity, or with a decreased undesirable activity. Also included are 

25 those fragments that are antigenic or immunogenic in an animal, especially in a human. Particularly preferred are 
fragments comprising receptors or domains ot enzymes that confer a function essential for viability of Staphylococcus 
aureus or the ability to initiate, or maintain cause disease in an individual, particularly a human. 

Variants that are fragments of the polypeptides of the Invention may be employed for producing the corresponding 
full-length polypeptide by peptide synthesis; therefore, these variants may be employed as intermediates for producing 

40 the full-length polypeptides of the invention. 

Polynucleotides of the invention: 

Another aspect of the invention relates to isolated polynucleotides, including the full length gene, that encode the 
45 RNaseP polypeptide having a deduced amino acid sequence of Table 1 [SEQ ID NO:2] and polynucleotides closely 

related thereto and variants thereof. 

Using the Information provided herein, such as a polynucleotide sequence set out in SEQ ID NOS: 1 . 3, 4 and 14. 

a polynucleotide of the invention encoding RNaseP polypeptide or RNA (such as that transcribed from SEQ ID NO:3) 

may be obtained using standard cloning and screening methods, such as those for cloning and sequencing chromo- 
50 somal DMA fragments from bacteria using Staphylococcus aureus WCUH 29 cells as starting material, followed by 
obtaining aiu1iiength-ctonerFor-exampferto-obtairha-polynucleotide-sequenee-oMhe-4nv«ntionr^uGh-as^^ 

given in SEQ ID NOS:T .3.4 and 14, typically a library of clones of chromosomal DNAof Staphylococcus aureus \NC\JH 

29 in E.coli or some other suitable host is probed with a radiolabeled oligonucleotide, preferably a 17-mer or longer. 

derived from a partial sequence. Clones carrying DNA identical to that of the probe can then be distinguished using 
ss stringent conditions. By sequencing the individual clones thus identified with sequencing primers designed from the 

original sequence it Is then possible to extend the sequence in both directions to determine the full gene sequence. 

Conveniently, such sequencing is performed using denatured double stranded DNA prepared from a plasmid clone. 

Suitable techniques are described by Maniatis, T, Fritsch. E.F. and Sambrook et aL, MOLECULAR CLONING, A U^B- 
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ORATORY MANUAL. 2nd Ed.: Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York (1989). (see in 
particular Screening By Hybridization 1.90 and Sequencing Denatured Double-Stranded DNA Templates 13.70). Illus- 
trative of the invention, the polynucleotides set out in Table I [SEQ ID NO;1 . 4 and 5] were discovered in a DNA library 
derived from Staphylococcus aureus WCUH 29. 

5 Certain DNA secuences set out in Table 1 [SEQ ID N0:1] contains an open reading frame encoding a protein 

having about the number of amino acid residues set forth in Table 1 [SEQ ID NO:2] with a deduced molecular weight 
that can be calculated using amino acid residue molecular weight values welt known in the art. The polynucleotide of 
SEQ ID NO: 1, between nucleotide number 1 through number 351 encodes the polypeptide of SEQ ID N0:2. The stop 
codon begins at nucleotide number 352 of SEQ ID NO: 1 . 

10 RNaseP of the invention is structurally related to other proteins of the RNase P family, as shown by the results of 

sequencing the DNA encoding RNaseP of the deposited strain. The protein exhibits greatest homology to B. subtilis 
protein among known proteins. RNaseP of Table i [SEQ ID N0:2] has signficant identity and similarity over its entire 
length with the amino acid sequence of subtilis RNase P polypeptide. 

The invention provides a polynucleotide sequence identical over its entire length to the coding sequence in Table 

is I [SEQ ID NO:1]. Also provided by the invention is the coding sequence for the mature polypeptide or a fragment 
thereof, by itself as well as the coding sequence for the mature polypeptide or a fragment in reading frame with other 
coding sequence, such as those encoding a leader or secretory sequence, a pre-, or pro- or prepro-protein sequence. 
The polynucleotide may also contain non-coding sequences, including for example, but not limited to non-coding 5' 
and 3' sequences, such as the transcribed, non-translated sequences, termination signals, ribosome binding sites, 

20 sequences that stabilize mRNA, introns, polyadenylation signals, and additional coding sequence which encode ad- 
ditional amino acids. For example, a marker sequence that facilitates purification of the fused polypeptide can be 
encoded. In certain embodiments of the invention, the marker sequence is a hexa-histidine peptide, as provided in the 
pQE vector (Qiagen, Inc.) and described in Gentz et ai, Proa Natl Acad Set,, USA 86: 821-824 (1 989), or an HA tag 
(Wilson et ai, Cell 37: 767 (1 984). Polynucleotides of the invention also include, but are not limited to, polynucleotides 

25 comprising a structural gene and its naturally associated sequences that control gene expression. 

A preferred embodiment of the invention is a polynucleotide of comprising nucleotide 1 to 351 or 354 set forth in 
SEQ ID NO: 1 of Table 1 which encode the RNaseP polypeptide. 

The invention also includes polynucleotides of the formula set forth in Table 1 (C)[SEQ ID N0:1 and (F)[SEQ ID 
NO:31 wherein, at the 5' end of the molecule, X is hydrogen, and at the 3' sr.d of ths mCleculs, Y is uyuroyon or a rnelai, 

30 R^ and is any nucleic acid residue, and n is an integer between 1 and 3000. Any stretch of nucleic acid residues 
denoted by either R group, where R is greater than 1 , may be either a heteropolymer or a homopolymer, preferably a 
heteropolymer. A preferred embodiment for the sequence set forth in Table 1(F) [SEQ ID NO:3] has R^ or Rg being 
between 1 and 10 or 1 and 20, and especially being 1, 2, 3, 4, 5. 6, 7, 8, 9 or 10. The invention also provides RNA 
transcribed from such polynucleotides, particularly catalytic RNAs. 

35 The term "polynucleotide encoding a polypeptide" as used herein encompasses polynucleotides that include a 

sequence encoding a polypeptide of the invention, particularly a bacterial polypeptide and more particularly a polypep- 
tide of the Staphylococcus aureus RNaseP having the amino acid sequence set out in Table 1 [SEQ ID NO:2]. The 
term also encompasses polynucleotides that include a single continuous region or discontinuous regions encoding the 
polypeptide (for example, interrupted by integrated phage or an insertion sequence or editing) together with additional 

40 regions, that also may contain coding and/or non-coding sequences. 

The invention further relates to variants of the polynucleotides described herein that encode for variants of the 
polypeptide having the deduced amino acid sequence of Table 1 [SEQ ID NO:2]. Variants that are fragments of the 
polynucleotides of the invention may be used to synthesize full-length polynucleotides of the invention. 

In addition to the standard A, G, C, T/U representations for nucleic acid bases, the term "N' is also used. "N" means 

45 that any of the four DNA or RNA bases may appear at such a designated position in the DNA or RNA sequence, except 
that, in preferred embodiments, N can not be a base that when taken in combination with adjacent nucleotide positions, 
when read in the correct reading frame, would have the effect of generating a premature termination codon in such 
reading frame. 

Further particularly preferred embodiments are polynucleotides encoding RNaseP variants, that have the amino 
50 acid sequence of RNaseP polypeptide of Table 1 [SEQ ID N0:2] in which several, a few, 5 to 10, 1 to 5, 1 to 3, 2, 1 or 

no'amin'o-acid-restdues^aresubstitutedrdeleted-oraddednn-anyxombinationr-Espeeia 

silent substitutions, additions and deletions, that do not alter the properties and activities of RNaseP 

Further preferred embodiments of the invention are polynucleotides that are at least 50%. 60% or 70% identical 
over their entire length to a polynucleotide encoding RNaseP polypeptide having an amino acid sequence set out in 
55 Table 1 [SEQ ID NO:2], and polynucleotides that are complementary to such polynucleotides. Alternatively, most highly 
preferred are polynucleotides that comprise a region that is at least 60% identical over its entire length to a polynucle- 
otide encoding RNaseP polypeptide of the deposited strain and polynucleotides complementary thereto. In this regard, 
polynucleotides at least 90% identical over their entire length to the same are particularly preferred, and among these 
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particularly preferred polynucleotides, those with at least 95% are especially preferred. Furthermore, those with at least 
97% are highly preferred among those with at least 95%, and among these those with at least 98% and at least 99% 
are particularly highly preferred, with at least 99% being the more preferred. 

Preferred embodiments of the invention are polynucleotides that are at least 50%. 60% or 70% identical over their 
5 entire length to an RNaseP polynucleotide having a nucleotide sequence set out in SEQ ID NO;3, 4 or 14, and poly- 
nucleotides that are complementary to such polynucleotides. Alternatively, most highly preferred are polynucleotides 
that comprise a region that is at least 80% identical over its entire length to an RNaseP polynucleotide of the deposited 
strain and polynucleotides complementary thereto. In this regard, polynucleotides at least 90% identical over their 
entire length to the same are particularly preferred, and among these particularly preferred polynucleotides, those with 
TO at least 95% are especially preferred. Furthermore, those with at least 97%j are highly preferred among those with at 
least 95%, and among these those with at least 98% and at least 99% are particularly highly preferred, with at least 
99% being the more preferred. It is especially preferred that these polynucleotides be RNAs, especially catalytic RN As. 

Preferred embodiments are polynucleotides that encode polypeptides that retain substantially the same biological 
function or activity as the mature polypeptide encoded by the DNA of Table 1 [SEQ ID NO: 1] or as the RNase P RNA 
'5 component transcribed by the DNA of SEQ ID NO:3, 4 or 14. 

The invention further relates to polynucleotides that hybridize to the herein above-described sequences. In this 
regard, the invention especially relates to polynucleotides that hybridize under stringent conditions to the herein above- 
described polynucleotides. As herein used, the terms "stringent conditions" and "stringent hybridization conditions" 
mean hybridization will occur only if there is at least 95% and preferably at least 97% identity between the sequences. 
20 An example of stringent hybridization conditions is overnight incubation at 42°C in a solution comprising: 50% forma- 
mide, 5x SSC (ISOmM NaCL 15mM trisodium citrate), 50 mM sodium phosphate (pH7.6), 5x Denhardt's solution, 10% 
dextran sulfate, and 20 micrograms/ml denatured, sheared salmon sperm DNA, followed by washing the hybridization 
support in 0.1 X SSC at about 65*0. Hybridization and wash conditions are well known and exemplified in Sambrook, 
etal., fy/lolecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y., (19S9), particularly Chapter 
25 11 therein. 

The invention also provides a polynucleotide consisting essentially of a polynucleotide sequence obtainable by 
screening an appropriate library containing the complete gene for a polynucleotide sequence set forth in SEQ ID NO: 
1 or SEQ ID NO:3 or SEQ ID N0:4 or SEQ ID NO:l4 under stringent hybridization conditions with a probe having the 
sequence of said polynucleotide sequence set forth in SEQ ID NO: 1 or SEQ !D NO:3 or SEQ ID NO:4 or SEQ iD NO: 

30 14 respectively or a fragment thereof; and isolating said DNA sequence. Fragments useful for obtaining such a poly- 
nucleotide include, for example, probes and primers described elsewhere herein. 

As discussed herein regarding polynucleotide assays of the invention, for instance, polynucleotides of the invention 
as discussed above, may be used as a hybridization probe for RNA, cDNA and genomic DNA to isolate full-length 
cDNAs and genomic clones encoding RNaseP and to isolate cDNA and genomic clones of other genes that have a 

35 high sequence similarity to the RNaseP gene. Such probes generally will comprise at least 15 bases. Preferably such 
probes will have at least 30 bases and may have at least 50 bases. Particularly preferred probes will have at least 30 
bases and will have 50 bases or less. 

Polynucleotides of the invention that are oligonucleotides derived from the sequences of SEQ ID NOS:1 and/or 2 
and/or 3 and/or 4 and/or 14 may be used in the processes herein as described, but preferably for PCR, to determine 

40 whether or not the polynucleotides identified herein in whole or in part are transcribed in bacteria in infected tissue. It 
is recognized that such sequences will also have utility in diagnosis of the stage of infection and type of infection the 
pathogen has attained. 

The invention also provides polynucleotides that may encode a polypeptide that is the mature protein plus additional 
amino or carboxyl -terminal amino acids, or amino acids interior to the mature polypeptide (when the mature form has 
45 mere than one polypeptide chain, for instance). Such sequences may play a role in processing of a protein from pre- 
cursor to a mature form, may allow protein transport, may lengthen or shorten protein half -life or may facilitate manip- 
ulatbn of a protein tor assay or production, among other things. As generally is the case in vivo, the additional amino 
acids may be processed away from the mature protein by cellular enzymes. 

A precursor protein, having the mature form of the polypeptide fused to one or more prosequences may be an 
50 inactive form of the polypeptide. When prosequences are removed such inactive precursors generally are activated. 

Someonall of4he-prosequences.may-be-removedJDefore3ctivation._Generally,-Such.precursors.arexalled„proproteins._ 

In sum, a polynucleotide of the invention may encode a mature protein, a mature protein plus a leader sequence 
(which may be referred to as a preprotein), a precursor of a mature protein having one or more prosequences that are 
not the leader sequences of a preprotein, or a preproprotein. which is a precursor to a proprotein, having a leader 
55 sequence and one or more prosequences. which generally are removed during processing steps that produce active 
and mature forms of the polypeptide. 
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Cloning of S. aureus RNase P RNA structural gene: 

A partial homoiog to B. subtilis RNase P RNA was identified in a proprietary S. aureus database comprising se- 
quences fronn randonniy sequenced S. aureus DNA library in an E. co// host. This homoiog (A33202: B. subtilis RNA 

5 nucleotides 311-^60) is shown in Figure 3 [SEQ ID NCXU]. 

A PGR primer based on these data was designed to the 3' end of the gene (primer: 5*-CGC GAA GTG TGT CTC 
GTT TAT ACG-3') [SEQ ID N0:5] and a second based on a universally conserved sequence within the 5' domain (5'- 
GAG GAA AGT CCA TGC TC-3') [SEQ ID NO:6] ( available from the RNase P database w.w.w.) permitted recovery 
of approximately 90% of the gene. The complete structural gene was amplified using a degenerate primer (5'-^/jGA- 

10 TATTTCQ/tGA/qTAA'''/cC-3') [SEQ ID NO: 13) that would allow the RNA product to form the predicted helices PI and 
P2 (Figure 1 ). The structural gene has been cloned behind a T7 promoter, sequenced and shown to be highly related 
to B. subtilis homoiog (Figure 2). The precise S. aureus genomic sequence encoding (transcribing) helices PI and P2 
may be determined by skilled artisans using methods and compounds of the invention^ such as the degenerate primer 
described above [SEQ ID NO: 1 3). 

15 The predicted secondary structure is shown in Figure 4 and based on data available for B. subtilis, reveals a limited 

number of compensatory base changes within helices (e.g. PI, P2, P8, Pi 2), an apparent shortening of helices P3 
and P9 and. an extension of helix P18. 

Further Description and Definitions: 

20 

The coding region of the RNaseP gene may be isolated, tor example, by screening using a deposit containing a 
Staphylococcus aureus WCUH 29 strain which has been deposited with the National Collections of Industrial and 
Marine Bacteria Ltd. (herein "NCIMB"), 23 St Machar Drive. Aberdeen AB2 IRY, Scotland on 11 September 1995 and 
assigned NCIMB Deposit No. 40771 It was referred to as Staphylococcus aureus WCUH29 on deposit. The Staphy- 
25 lococcus aureus strain deposit is referred to herein as "the deposited strain" or as "the DNA of the deposited strain." 

The deposited strain contains the full length RNaseP gene. The sequence of the polynucleotides contained in the 
deposited strain, as well as the amino acid sequence of the polypeptide encoded thereby, are controlling in the event 
of any conflict with any description of sequences herein. 

The deposit ot the deposited sUain has been made under the terms ot the Budapest Treaty on the International 
30 Recognition of the Deposit of Micro-organisms for Purposes of Patent Procedure. The strain will be irrevocably and 
without restriction or condition released to the public upon the issuance of a patent. The deposited strain is provided 
merely as convenience to those of skill in the art and is not an admission that a deposit is required for enablement, 
such as that required under 35 U.S.C. §112. 

A license may be required to make, use or sell the deposited strain, and compounds derived therefrom, and no 
35 such license is hereby granted. 

The nucleotide sequences disclosed herein can also be obtained by synthetic chemical techniques known in the 
art or can be obtained from S. aureus WCUH 29 by probing a DNA preparation wrth probes constructed from the 
particular sequences disclosed herein. Alternatively, oligonucleotides derived from a disclosed sequence can act as 
PCR primers in a process of PCR-based cloning of the sequence from a bacterial genomic source. It is recognized 
40 that such sequences will also. have utility in diagnosis of the type of infection the pathogen has attained. 

A polynucleotide of the present invention may be in the form of RNA or in the form ot DNA, which DNA includes 
cDNA, genomic DNA, and synthetic DNA The DNA may be double-stranded or single-stranded, and if single stranded 
may be the coding strand or non-coding (anti-sense) strand. The coding sequence which encodes the polypeptide may 
be identical to the ceding sequence shown or may be a different coding sequence which coding sequence, as a result 
45 of the redundancy or degeneracy of the genetic code, encoding the same polypeptide. 

Thus, the term "polynucleotide encoding a polypeptide" encompasses a polynucleotide which includes only coding 
sequence for the polypeptide as well as a polynucleotide which includes additional coding and/or non-coding sequence. 

The present invention therefore includes polynucleotides, wherein the coding sequence for the mature polypeptide 
may be fused in the same reading frame to a polynucleotide sequence which aids in expression and secretion of a 
50 polypeptide from a host cell, for example, a leader sequence which functions as a secretory sequence for controlling 

transport-of-a-polypeptide-frcffTi-the-oellrT^e-polypeptide^naving-a^eader-sequeneeH 

leader sequence cleaved by the host cell to fomn the mature form of the polypeptide. The polynucleotides may also 
encode for a proprotein which is the mature protein plus additional 5' amino acid residues. A mature protein having a 
prosequence is a proprotein and is an inactive form of the protein. Once the prosequence is cleaved an active mature 
55 protein remains. 

Thus, for example, the polynucleotide of the present invention may encode for a mature protein, or for a protein 
having a prosequence or for a protein having both a prosequence and a presequence (leader sequence). Further, the 
amino acid sequences provided herein show a methionine residue at the NH2-terminus. It is appreciated, however, 
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that during post-translational modification of the peptide, this residue may be deleted. Accordingly, this invention con- 
templates the use of both the methionine-containing and the methionineless amino terminal variants of each protein 
disclosed herein. 

The polynucleotides of the present invention may also have the coding sequence fused in frame to a marker 
5 sequence at either the 5' or 3' terminus of the gene which allows for purification of the polypeptide of the present 
invention. The marker sequence may be a hexa-histldine tag supplied by the pQE series of vectors (supplied commer- 
cially by Quiagen Inc.) to provide for purification of the polypeptide fused to the marker in the case of a bacterial host. 
Alternatively the maltose binding protein (MBP) fusion system may be employed. In this system the gene of interest 
is fused the malE gene encoding the MBP (supplied by New England BloLabs). The fusion product is purified in a one 
10 step procedure based on the MBP affinity for maltose. A pre-engineered Xa cleavage site allows for efficient removal 
of the MBP component from the gene product of interest. 

In order to facilitate understanding of the following example certain frequently occurring methods and/or terms will 
be described. 

Tlasmids" are designated by a lower case p preceded and/or followed by capital letters and/or numbers. The 
'5 starting plasmids herein are either commercially available, publicly available on an unrestricted basis, or can be con- 
structed from available plasmids in accord with published procedures. In addition, equivalent plasmids to those de- 
scribed are known in the art and will be apparent to the ordinarily skilled artisan. 

"Digestion" of DNA refers to catalytic cleavage of the DNA with a restriction enzyme that acts only at certain 
sequences in the DNA. The various restriction enzymes used herein are commercially available and their reaction 
20 conditions, cofactors and other requirements were used as would be known to the ordinarily skilled artisan. For ana- 
lytical purposes, typically 1 ng of plasmid or DNA fragment is used with about 2 units of enzyme in about 20jil of buffer 
solution. For the purpose of isolating DNA fragments for plasmid construction, typically 5 to 50 ug of DNA are digested 
with 20 to 250 units of enzyme in a larger volume. Appropriate buffers and substrate amounts for particular restriction 
enzymes are specified by the manufacturer Incubation times of about 1 hour at 37**C are ordinarily used, but may vary 
25 In accordance with the supplier's instructions. After digestion the reaction is electrophoresed directly on an agarose 
gel to isolate the desired fragment. Size separation of the cleaved fragments is generally performed using a 1 % percent 
agarose gel. 

"Oligonucleotides' refers to either a single stranded polydeoxy nucleotide or two complementary potydeoxynucle- 
otide strands which may be chemically synthesized. Such synthetic oligonucleotides have no 5' phosphate and thus 
will not ligatetoanotheroligonucleotide without adding a phosphate with an ATP in the presence of a kinase. A synthetic 
oligonucleotide will ligate to a fragment that has not been dephosphorylated. 

"Ligation" refers to the process of forming phosphodiester bonds between two double stranded nucleic acid frag- 
ments (Maniatis, T., et aL, supra. , p. 146). Unless otherwise provided, ligation may be accomplished using known 
buffers and conditions with 10 units to T4 DNA ligase ("tigase") per 0.5 \xq of approximately equimolar amounts of the 
55 DNA fragments to be ligated. 

The polypeptides and polynucleotides of the present invention are preferably provided in an isolated form, and 
preferably are purified to homogeneity. 

A "replicon" is any genetic element (e.g., plasmid, chromosome, virus) that functions as an autonomous unit of 
DNA replication in vivo; i.e., capable of replication under its own control. 
40 A Vector" is a replicon, such as a plasmid, phage, or cosmid, to which another DNA segment may be attached so 

as to bring about the replication of the attached segment. 

A "double-stranded DNA molecule* refers to the polymeric form of deoxy ribonucleotides (bases adenine, guanine, 
thymine, or cytosine) in a double-stranded helix, both relaxed and supercoiled. This term refers only to the primary 
and secondary structure of the molecule, and does not limit it to any particular tertiary forms. Thus, this term includes 
45 double-stranded DNA found, inter alia, in linear DNA molecules (e.g., restriction fragments), viruses, plasmids, and 
chromosomes. In discussing the structure of particular double-stranded DNA molecules, sequences may be described 
herein according to the normal convention of giving only the sequence in the 5' to 3' direction along the nontranscribed 
strand of DNA (i.e., the strand having the sequence homologous to the mRNA). 

A DNA 'coding sequence of or a "nucleotide sequence encoding" a particular protein, is a DNA sequence which 
50 Is transcribed and translated into a polypeptide when placed under the control of appropriate regulatory sequences. 

A— pr^moter-sequenGe~is-a-DNA-r«gufatory"regiGn-capable-of-bindirvg~RNA-polymerase4r^-a-cel^ 

transcription of a downstream (3' direction) coding sequence. For purposes of defining the present invention, the pro- 
moter sequence is bound at the 3' terminus by a translation start codon {e.g.. ATG) of a coding sequence and extends 
upstream (5' direction) to include the minimum number of bases or elements necessary to initiate transcription at levels 
5S detectable above background. Within the promoter sequence will be found a transcription initiation site (conveniently 
defined by mapping with nuclease SI ), as well as protein binding domains (consensus sequences) responsible for the 
binding of RNA polymerase. Eukaryotic promoters will often, but not always, contain "TATA" boxes and "CAT" boxes. 
Prokaryotic promoters contain the -10 and -35 consensus sequences. 
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DNA "control sequences' refers collectively to pronnoter sequences, ribosonne binding sites, polyadenylation sig- 
nals, transcription termination sequences, upstreann regulatory domains, enhancers, and the like, which collectively 
provide for the expression (i.e.. the transcription and translation) of a coding sequence in a host cell. 

A control sequence "directs the expression" of a coding sequence in a cell when RNA polymerase will bind the 
5 promoter sequence and transcribe the coding sequence into mRNA, which is then translated into the polypeptide 
encoded by the coding sequence. 

A "host ceir is a cell which has been transformed or transfected, or is capable of transformation or transfection by 
an exogenous DNA sequence. 

A celt has been 'transformed" by exogenous DNA when such exogenous DNA has been introduced inside the cell 
10 membrane. Exogenous DNA may or may not be integrated (covalently linked) into chromosomal DNA making up the 
genome of the cell. In prokaryotes and yeasts, for example, the exogenous DNA may be maintained on an episomal 
element, such as a plasmid. With respect to eukaryotic cells, a stably transformed or transfected cell is one in which 
the exogenous DNA has become integrated into the chromosome so that it is inherited by daughter cells through 
chromosome replication. This stability is demonstrated by the ability of the eukaryotic cell lo establish cell lines or 
?5 clones comprised of a population of daughter cell containing the exogenous DNA. 

A "clone" is a population of cells derived from a single cell or common ancestor by mitosis. A "cell line" is a ctone 
of a primary celt that is capable of stable growth in vitro for many generations. 

A "heterologous* region of a DNA construct is an identifiable segment of DNA within or attached to another DNA 
molecule that is not found in association with the other molecule in nature. 

20 

Preparation of the RNase P protein component: 

The present invention also relates to vectors which include polynucleotides of the present invention, host cells 
which are genetically engineered with vectors of the invention and the production of polypeptides of the invention by 

25 recombinant techniques. 

In accordance with yet a further aspect of the present invention, there is therefore provided a process for producing 
the polypeptide of the invention by recombinant techniques by expressing a polynucleotide encoding said polypeptide 
in a host and recovering the expressed product. Alternatively the polypeptides of the invention can be synthetically 
produced by conventional peptide synthesizers. 

30 Host cells are genetically engineered (transduced or transformed or transfected) with the vectors of this invention 

which may be. for example, a cloning vector or an expression vector. The vector may be, for example, in the form of 
a plasmid, a cosmid, a phage, etc. The engineered host cells can be cultured in conventional nutrient media modified 
as appropriate for activating promoters, selecting transformants or amplifying the genes. The culture conditions, such 
as temperature, pH and the like, are those previously used with the host cell selected for expression, and will be 

55 apparent to the ordinarily skilled artisan. 

Suitable expression vectors include chromosomal, nonchromosomaland synthetic DNA sequences, e.g., bacterial 
plasmids; phage DNA: baculovirus; yeast plasmids; vectors derived from combinations of plasmids and phage DNA. 
However, any other vector may be used as long as it is replicable and viable in the host. 

For recombinant production, host cells can be genetically engineered to incorporate expression systems or portions 

40 thereof or polynucleotides of the invention. Introduction of a polynucleotide into the host eel! can be effected by methods 
described in many standard laboratory manuals, such as Davis et al„ BASIC METHODS IN MOLECULAR BIOLOGY, 
(1986) and Sambrcok et al.. MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed., Cold Spring Harbor Lab- 
oratory Press, Cold Spring Harbor, N.Y (1989), such as, calcium phosphate transfection, DEAE-dextran mediated 
transfection, transvection, microinjection, cationic lipid-mediated transfection, electroporation. transduction, scrape 

•^5 loading, ballistic introduction and infection. 

Representative examples of appropriate hosts include bacterial cells, such as streptococci, staphylococci, ente- 
rococci E. coli, streptomyces and Bacillus subtilis cells; fungal cells, such as yeast cells and Aspergillus cells: insect 
cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO. COS. HeLa, C127. 3T3. BHK. 293 
and Bowes melanoma cells; and plant cells. 

50 A great variety of expression systems can be used to produce the polypeptides of the invention. Such vectors 

-includeramon got he rsrchromosomalreptsonial and virus<le rived vectorsra-g,Tvectorsi^ 
from bacteriophage, from transposons, from yeast episomes, from insertion elements, from yeast chromosomal ele- 
ments, from viruses such as baculoviruses, papova viruses, such as SV40, vaccinia viruses, adenoviruses, fowl pox 
viruses, pseudorabies viruses and retroviruses, and vectors derived from combinations thereof such as those derived 

55 from plasmid and bacteriophage genetic elements, such as cosmids and phagemids. Th expression system constructs 
may contain control regions that regulate as well as engender expression. Generally any system or vector suitable to 
maintain, propagate or express polynucleotides and/or to express a polypeptid in a host may be used for expression 
in this regard. The appropriate DNA sequence may be inserted into the expression system by any of a variety of well- 
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known and routine techniques, such as. for example, those set forth in Sambrook et ai, MOLECULAR CLONING, A 
LABORATORY MANUAL (supra). 

The appropriate DNA sequence may be inserted into the vector by a variety of procedures. In general, the DNA 
sequence is inserted into an appropriate restriction endonuclease site(s) by procedures known in the art. 

The DNA sequence in the expression vector is operatively linked to an appropriate expression control sequence 
(s) (promoter) to direct mRNA synthesis. As representative examples of such promoters, there may be mentioned: 
LTR or SV40 promoter, the E. coli lac or trp, the phage lambda P|_ promoter and other promoters known to control 
expression of genes in eukaryotic or prokaryotic cells or their viruses. The expression vector may also contains a 
ribosome binding site for translation initiation and/or a transcription terminator. The vector may also include appropriate 
sequences for amplifying expression. 

In addition, the expression vectors preferably contain one or more selectable marker genes to provide a phenotypic 
trait for selection of transformed host cells such as dihydrofolate reductase or neomycin resistance for eukaryotic cell 
culture, or such as tetracycline or ampicillin resistance in E coiL 

The gene can be placed under the control of a promoter, ribosome binding site (for bacterial expression) and, 
optionally, an operator (collectively referred to herein as "control" elements), so that the DNA sequence encoding the 
desired protein is transcribed into RNA in the host cell transformed by a vector containing this expression construction. 
The coding sequence may or may not contain a signal peptide or leader sequence. The polypeptides of the present 
invention can be expressed using, for example, the E co//tac promoter or the protein A gene (spa) promoter and signal 
sequence. Leader sequences can be removed by the bacterial host in post-translational processing. See, e.g., U.S. 
Patent Nos. 4,431,739; 4,425,437; 4.338,397. Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. Two appropriate vectors are PKK232-8 
and PCM7. Particular named bacterial promoters include lad, lacZ, T3; T7, gpt, lambda Pr, P|_ and trp. Eukaryotic 
promoters include CMV immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and mouse 
metailothionein-l. Selection of the appropriate vector and promoter is well within the level of ordinary skill in the art. 

In addition to control sequences, it may be desirable to add regulatory sequences which allow for regulation of the 
expression of the protein sequences relative to the growth of the host cell. Regulatory sequences are known to those 
of skill in the art, and examples include those which cause the expression of a gene to be turned on or off in response 
to a chemical or physical stimulus, including the presence of a regulatory compound. Other types of regulatory elements 
may also be present in the vector, for example, enhancer sequences. 

An expression vector is constructed so that the particular coding sequence is located in the vector with the appro- 
priate regulatory sequences, the positioning and orientation of the coding sequence with respect to the control se- 
quences being such ihat the coding sequence is transcribed under the "control" of the control sequences (i.e., RNA 
polymerase which binds to the DNA molecule at the control sequences transcribes the coding sequence). Modification 
of the coding sequences may be desirable to achieve this end. For example, in some cases rt may be necessary to 
modify the sequence so that it may be attached to the control sequences with the appropriate orientation; i.e., to 
maintain the reading frame. The control sequences and other regulatory sequences may be ligated to the coding 
sequence prior to insertion into a vector, such as the cloning vectors described above. Alternatively, the coding se- 
quence can be cloned directly into an expression vector which already contains the control sequences and an appro- 
priate restriction site. 

Generally, recombinant expression vectors will include origins of replication and selectable markers permitting 
transformation of the host ceil, e.g., the ampicillin resistance gene of E. coli and S. cerevisiae TRPI gene, and a 
promoter derived from a highly -expressed gene to direct transcription of a downstream structural sequence. The het- 
erologous structural sequence is assembled in appropriate phase with translation initiation and termination sequences, 
and preferably, a leader sequence capable of directing secretion of translated protein into the periplasmic space or 
extracellular medium. Optionally, the heterologous sequence can encode a fusion protein including an N-terminal iden- 
tification peptide imparting desired characteristics, e.g., stabilization or simplified purification of expressed recombinant 
product. 

The vector containing the appropriate DNA sequence as hereinabove described, as well as an appropriate pro- 
moter or control sequence, may be emptoyed to transform an appropriate host to permit the host to express the protein. 

More particularly, the present invention also includes recombinant constructs comprising one or more of the se- 
quences-as-bfoadly-descr4bed^bove.-The-constructs-comprise-a-veclor~such-as-a-plasmid-or-vi 
a sequence of the invention has been inserted, in a forward or reverse orientation. In a preferred aspect of this em- 
bodiment, the construct further comprises regulatory sequences, including, for example, a promoter, operably linked 
to the sequence. Large numbers of suitable vectors and promoters are known to those of skill in the art, and are 
commercially available. The following vectors are provided by way of example. Bacterial: pET-3 vectors (Stratagene), 
pQE70, pQE60, pQE-9 (Qiagen), pbs, pDiO, phagescript, psiX174, pbluescript SK, pbsks, pNHBA, pNH16a. pNHISA, 
pNH46A (Stratagene); ptrc99a, pKK223-3, pKK233-3. pDR540, pRITS (Pharmacia). Eukaryotic: pBlueBaclll (Invitro- 
gen), pWLNEO, pSV2CAT pOG44, pXTl . pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia). However, any 
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other plasmid or vector may be used as long as they are replicable and viable in the host. 

Examples of recombinant DNA vectors for cloning and host celts which they can transform include the bacteri- 
ophage X (E. coli), p3R322 (E. colt), pACYC177 (E. coii), pKT230 (gram-negative bacteria), pGVl106 (gram-negative 
bacteria), pLAFRl (gram-negative bacteria). pME290 (non-E. co// gram-negative bacteria). pHVl4 (E. coli and Bacillus 

5 subtilis), pBD9 (Bacillus), plJ61 (Streptomyces), pUC6 (Streptomyces), Ylp5 (Saccharomyces), a baculovirus insect 
cell system, YCp19 (Saccharomyces). See, generally. "DNA Cloning": Vols. I & II, Glover etal. ed. IRL Press Oxford 
(1985) (1987) and: T. Maniatis etaL ("Molecular Cloning" Cold Spring Harbor Laboratory (1982). 

In some cases, it may be desirable to add sequences which cause the secretion of the polypeptide from the host 
organism, with subsequent cleavage of the secretory signal. 

10 Polypeptides can be expressed in host cells under the control of appropriate promoters. Cell-free translation sys- 

tems can also be employed to produce such proteins using RNAs derived from the DNA constructs of the present 
invention. Appropriate cloning and expression vectors tor use with prokaryotic and eukaryotic hosts are described by 
Sambrook, et ai. Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y., (1989), the 
disclosure of which is hereby incorporated by reference. 

15 Following transformation of a suitable host strain and growth of the host strain to an appropriate cell density, the 

selected promoter is induced by appropriate means (e.g. , temperature shift or chemical induction) and celts are cultured 
for an additional pehod. 

Cells are typically harvested by centrifugation, disrupted by physical or chemical means, and the resulting crude 
extract retained for further purification. 
20 Microbial cells employed in expression of proteins can be disrupted by any convenient method, including freeze- 

thaw cycling, sonication, mechanical disruption, or use of cell lysing agents, such methods are well known to those 
skilled in the art. 

Depending on the expression system and host selected, the polypeptide of the present invention may be produced 
by growing host cells transformed by an expression vector described above under conditions whereby the polypeptide 

25 of interest is expressed. The polypeptide is then isolated from the host cells and purified. If the expression system 
secretes the polypeptide into growth media, the polypeptide can be purified directly from the media. If the polypeptide 
is not secreted, it is isolated from cell lysates or recovered from the cell membrane fraction. Where the polypeptide is 
localized to the cell surface, whole cells or isolated membranes can be used as an assayable source of the desired 
gene product. Polypeptide expressed in bacterial hosts such as E. coH may require isclaticn frorri inclusion bodies and 

30 refolding. Where the mature protein has a very hydrophobic region which leads to an insoluble product of overexpres- 
sion, it may be desirable to express a truncated protein in which the hydrophobic region has been deleted. The selection 
of the appropriate growth conditions and recovery methods are within the skill of the art. 

The polypeptide can be recovered and purified from recombinant cell cultures by methods including ammonium 
sulphate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phosphocelluiose chro- 

35 matography, hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite chromatography and 
lectin chromatography. Protein refolding steps can be used, as necessary, in completing configuration of the mature 
protein. Finally, high performance liquid chromatography (HPLC) can be employed for final purification steps. 

Depending upon the host employed in a recombinant production procedure, the polypeptides of the present in- 
vention may be glycosylated or may be non -glycosylated. Polypeptides of the invention may also include an initial 

40 methionine amino acid residue. 

Preparation of the RNase P RNA Component: 

The RNase P RNA molecules are prepared by run-off in v/fro transcription using T7 RNA polymerase as according 
^5 to standard conditions - usually as recommended by the supplier, e.g., Promega. The plasmid is linearized with an 
appropriate restriction enzyme generating a linear dsDNA comprising the full length gene encoding the RNase PRNA. 
The RNA is purified either from a preparative denaturing acylamide gel or is precipitated prior to use in in Wfrocieavage 
assays. The substrates for the RNase P RNA and the RNA complexed with its protein (RNase P protein) can be 
obtained by in vitro transcription of cloned genes. Useful substrates included but are not limited to pre-tRNA^®* or E. 
50 coli or B. subtilis pre-4.5S molecules and may be expressed using an in vitro transcription system directed by T7 RNA 
polymerase-as-desefibed-abover^-heH=^NA-can-also-be-prepafed-by-automated-synthesis^ 

Antagonists and agonists - assavs and molecules 

55 This invention provides a method of screening drugs to identify those which interfere with the RNA portion, the 

protein portion and/ or the intact RNA/protein complex of the RNase P described herein, which method comprises 
measuring the interference of the activity of the protein and/or RNA by a test drug. For example since the RNA portion 
selected has a catalytic activity, after suitable purification and formulation the activity of th RNA can be followed by 
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its ability to convert its natural or synthetic RNA substrates. By incorporating different chennically synthesized test 
compounds or natural products into such an assay of enzymatic activity one is able to detect those additives which 
compete with the natural or synthetic substrate or otherwise inhibit enzymatic activity 

Polypeptides of the invention may also be used to assess the binding of small molecule substrates and tlgands in, 

5 for example, cells, cell-free preparations^ chemical libraFies, and natural product mixtures. These substrates and ligands 
may be natural substrates and ligands or may be structural or functional mimettcs. See. e.g., Coligan etai, Current 
Protocols in Immunoicgy 1(2): Chapter 5 (1 991 ). 

^ The invention also provides a method of screening compounds to identify those which enhance (agonist) or block 
(antagonist) the a ction of RNaseP polypeptides or polynucleotides, particularly those compoun ds that are bactenostatic ^ 

lo- — STCTor bact eriocidal. The method of screening may involve high-throughput techniques, hor example, to screert toi ^ 

* ^agofTIIts or antagonists, a synthetic reaction mix, a cellular compartment, such as a membrane, cell envelope or cell 
wall, or a preparation of any thereof, comprising RNaseP polypeptide and a labeled substrate or ligand of such polypep- 
tide is incubated in the absence or the presence of a candidate molecule that may be a RNaseP agonist or antagonist. 
The ability of the candidate molecule to agonize or antagonize the RNaseP polypeptide is reflected in decreased binding 

15 of the labeled ligand or decreased production of product from such substrate. Molecules that bind gratuitously, Le., 
without inducing the effects of RNaseP polypeptide are most likely to be good antagonists. Molecules that bind well 
and increase the rate of product production from substrate are agonists. Detection of the rate or level of production of 
product from substrate may be enhanced by using a reporter system. Reporter systems that may be useful in this 
regard include but are not limited to colorimetric labeled substrate converted into product, a reporter gene that is 

20 responsive to changes in RNaseP polynucleotide or polypeptide activity, and binding assays known in the art. 

Another example of an assay for RNaseP antagonists is a competitive assay that combines RNaseP and a potential 
antagonist with RNaseP-binding molecules, recombinant RNaseP binding moieculeS: natural substrates or ligands, or 
substrate or ligand mimetics, under appropriate conditions for a competitive inhibition assay. RNaseP can be labeled, 
such as by radioactivity or a colorimetric compound, such that the number of RNaseP molecules bound to a binding 

25 nnolecule or converted to product can be determined accurately to assess the effectiveness of the potential antagonist. 

Potential antagonists include small organic molecules, peptides, polypeptides and antibodies that bind to a poly- 
nucleotide or polypeptide of the invention and thereby inhibit or extinguish its activity. Potential antagonists also may 
be small organic molecules, a peptide, a polypeptide such as a closely related protein or antibody that binds the same 
sites on a binding .mclcculG, such as a binding molecule, withoui inducing RNaseP-induced activities, thereby prevent- 

30 ing the action of RNaseP by excluding RNaseP from binding- 
Potential antagonists include a small molecule that binds to and occupies the binding site of the polypeptide thereby 
preventing binding to cellular binding molecules, such that normal biological activity is prevented. Examples of small 
molecules include but are not limited to small organic molecules, peptides or peptide-like molecules. Other potential 
antagonists include antisense molecules (see Okano, J. Neurochem. 56: 560 (1 991 ); OUGODEOXYNUCLEOTIDES 

35 AS ANTISENSE INHIBITORS OF GENE EXPRESSION CRC Press. Boca Raton, FL (1 988). for a description of these 
molecules). Preferred potential antagonists include compounds related to and variants of RNaseP. 

Each of the DNA sequences provided herein may be used in the discovery and development of antibacterial com- 
pounds. The encoded protein, upon expression, can be used as a target for the screening of antibacterial drugs. Ad- 
ditionally, the DNA sequences encoding the amino terminal regions of the encoded protein or Shine-Delgarno or other 

^0 translation facilitating sequences of the respective mRNA can be used to construct antisense sequences to control the 
expression of the coding sequence of interest 

The invention also provides the use of the polypeptide, polynucleotide or inhibitor of the invention to interfere with 
the initial physical interaction between a pathogen and mammalian host responsible for sequelae of infection. In par- 
ticular the molecules of the invention may be used: in the prevention of adhesion of bacteria, in particular gram positive 

^5 bacteria, to mammalian extracellular matrix proteins on in-dwelling devices or to extracellular matrix proteins in wounds; 
to block RNaseP protein-mediated mammalian cell invasion by, for example, initiating phosphorylation of mammalian 
tyrosine kinases (Rosenshine etaL, Infect. Immua 60.22^^ (1992); to block bacterial adhesion between mammalian 
extracellular matrix proteins and bacterial RNaseP proteins that mediate tissue damage and; to block the normal pro- 
gression of pathogenesis in infections initiated other than by the implantation of in-dwelling devices or by other surgical 

50 techniques. 

Theantagonists-and-agonists-of^lie^nvenlion-may-be-employedHor-instaneeHo-inhibft-^nd-treat-^i 

as. infections of the upper respiratory tract (e.g., otitis media, bacterial tracheitis, acute epiglottitis, thyroiditis), lower 
respiratory (e.g., empyema, lung abscess), cardiac (e.g., infective endocarditis), gastrointestinal (e.g., secretory diar- 
rhoea, splenic absces, retroperitoneal abscess), CNS (e.g., cerebral abscess), eye (e.g., blepharitis, conjunctivitis, 

55 keratitis, endophthalmitis, preseptal and orbital cellulitis, darcryocystitis), kidney and urinary tract (e.g., epididymitis, 
intrarenal and perinephric absces, toxic shock syndrome), skin (e.g., impetigo, folliculitis, cutaneous abscesses, cel- 
lulitis, wound infection, bacterial myositis) bone and joint (e.g., septic arthritis, osteomyelitis). 
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HTP Screening Strategies: 

Assays can be developed to detect compounds that inhibit RNase P directed cleavage o1 RNA substrates. Several 
possible assay fornnats are suitable for HTP screening based upon the ability to incorporate labels within the RNA in 
5 a site-specific fashion by chemical synthesis. The conventional radioactive-based format is preferred, while a homo- 
geneous fluorescence-based format is useful for subsequent follow-up of lead compounds. The use of both formats 
is contemplated by this invention. 

Functional RNase P Assay 

10 

Biotin Is introduced in to an appropriate position within the RNA substrate and the 5' terminus labeled with ^^P. 
The substrate is linked via streptavidin within a 96-well plate. Following RNase P dependent hydrolysis of the substrate, 
the radiolabelled 5' leader cleavage product is released into the bulk solution phase, and subject to scintillation counting. 
Altematively, the RNA substrate is bound to a streptavidin-coated flashplate such that the release of the radioactive 
15 6-mer into the solution phase results in a decrease in signal. This has the advantage that i1 is a homogeneous, con- 
tinuous assay format and requires no additional manipulations after starting the assay. Both formats are useful in the 
practice of this invention because they use the same RNA substrate. 

RNA Fragment Library Rescue 

20 

An effective approach for identifying compounds that interact with RNA is contemplated. The concept is based on 
the over-expression of a drug binding site that is recreated on an RNA fragment, which will sequester the drug and 
permit the continued functioning of the intact ribozyme (Figure 6). This approach has recently been described in the 
context of asearch for ligandsthat bind ribosomal RNA (Howard, B-A, etaL, Bbchem. Cell Bio . 73(11/12): 1161-1166 
25 (1995)). Following selection the random RNA fragments that apparently present a minimal target structure for drug 
recognition, are incorporated into a protocol for rational drug design. Accordingly, random fragment libraries based on 
Ml RNA will be generated and used in HTP screening to identify compounds that disrupt RNA/protein interaction. 
(Figure 7) 

30 Cyclic peptide phage libraries: 

The incorporation of conformational constraints into flexible lead compounds is a powerful strategy to increase 
lead potency and is particularly useful in the field of peptidomimetic design. ( Al-Obeidi, F. ai, J. Med. Chem. 32: 
2555-2561 (1 989);Barker, PL, et al., J. Med. Chem. 35: 2040-2048 (1 992)). Cyclization has been shown to increase 
55 the propensity for beta-turn formation in peptides the potential of which has been demonstrated by the identification 
of high-affinity ligands for streptavidin (Lee, M. 8., etaL, FEBS Lett 359: 113-118 (1995)). 

In this case, cyclic peptide libraries were constructed with flanking cysteine residues to allow efficient disulfide bond 
formation and cyclization during phage assembly. The streptavidin bound crystal structures of two disulfide bridged 
cyclic peptides showed both peptides to be in beta-turn conformations (Kahn, M. (Guest, Ed., 1993) Tetrahedron 49, 

40 Symp. 50, 3433-3677). 

Beta-turns are key recognition elements in many biological interactions therefore effort has been focused on the 
design of small constrained beta-turn mimics (Kahn. M. (Guest Ed., 1993) Tetrahedron 49, Symp. 50, 3433-3677). This 
approach, when applied to RNase P, could identify cyclic peptides suitable for peptide mimic synthesis as inhibitor 
molecules. A cyclic octapeptide phage display library may be constructed and used to identify peptides that interact 

45 with defined RNA domains. 

Secondary Evaluation 

SELEX: Systematic Evolution of Ligands by Exponential Enrichment: 

50 

This~approach-nrTay-be-emptoyediri-an-attempt-toidentify-RNA-recognition-motifs-for-RNase-P-R^ 

protein binding for structural analysis as an aid to rational drug design and the secondary evaluation of compounds 
identified via the HTP screens. The technology is based on the repeated selection and amplification of RNA fragments 
that specifically bind to a protein with high affinities (Szostak, J.W., TIBS 17: 89-93 (1992)). Fragment libraries based 

55 on the S. aureus and E. cofi RNase P RNAs may be constructed for the in vitro synthesis of RNA fragments and the 
subsequent selection of molecules that bind their respective proteins. Chemical and enzymatic structure probing tech- 
nologies may be emptoyed in combination with protein/RNA protection studies to map the interactive sites. SELEX 
based on the resulting RNA f ragment(s) may be further exploited to determine the minimal structural requirements for 
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RNA recognition. 

Disruption of RNase P Assembly 

The identification of a prolein/RN A-fragment pair permits the development of a screen for compounds that disrupt 
their assembly. Drug induced disruption o1 labeled RNA bound to immobilized protein (biotin/streptavidin) would result 
in the concomitant decrease/loss of the signal generated by the presence of the RNA (Figure 6 ). 

RNA/Druq Interactions 

RNase P RNA fragments that confer drug resistance (RNA Fragment Rescue Library supra) may be sequenced 
and expressed in vitro for chemical and enzymatic structure probing in the presence and absence of the drug in an 
attempt to map the binding site. SELEX may be applied to lead compounds in an attempt to identify the minimal struc- 
tural requirements for drug binding. 

RNase P Substrates 

Minimal RNA substrates may be chemically synthesized for HTP screening including both pre-tRNA and pre-4.5S 
RNA derivatives (e.g. Figure 8). RNA-ligand interactions involving ribose 2'-hydroxyl groups of specific nucleotides 
may be probed via chemical synthesis of the appropriately modified RNA fragment. In order to retain the C^'-endo 
configuration characteristic of ribonucleotides, 2'-methoxy and 2'-fluororibonucleotides analogues can be used, the 
latter being preferred on steric grounds. Nucleotides lacking a 2'-substituent adopt the undesired C^-endo configura- 
tion. 

The invention also relates to inhibitors identified by any of the techniques described herein. Because of the enzy- 
matic nature of RNase P action, it is appreciated that inhibitors may be identified which act as transition state mimics, 
inhibitors of product release or inhibitors of substrate binding. 

Diagnostic Assays 

This invention is also related to the use of the RNaseP polynucleotides of the invention tor use as diagnostic 
reagents. Detection of RNaseP in a eukaryote, particularly a mammal, and especially a human; will provide a diagnostic 
method for diagnosis of a disease. Eukaryotes (herein also ''individual(s)''), particularly mammals, and especially hu- 
mans, infected with an organism comprising the RNaseP gene may be detected at the nucleic acid level by a variety 
of techniques. 

Nucleic acids for diagnosis may be obtained from an infected individual's cells and tissues, such as bone, blood, 
muscle, cartilage, and skin. Genomic DNA may be used directly for detection or may be amplified enzymatically by 
using PGR or other amplification technique prior to analysis. RNA or cDNA may also be used in the same ways. Using 
amplification, characterization of the species and strain of prokaryote present in an individual, may be made by an 
analysis of the genotype of the prokaryote gene. Deletions and insertions can be detected by a change in size of the 
amplified product in comparison to the genotype of a reference sequence. Point mutations can be identified by hybrid- 
izing amplified DNA to labeled RNaseP polynucleotide sequences. Perfectly rriatched sequences can be distinguished 
from mismatched duplexes by RNase digestion or by differences in melting temperatures. DNA sequence differences 
may also be detected by alterations in the electrophoretic mobility of the DNA fragments in gels, with or without dena- 
turing agents, or by direct DNA sequencing. See, e.g., I^/lyers et al., Science^ 230: 1242 (1 985). Sequence changes at 
specific locations also may be revealed by nuclease protection assays, such as RNase and SI protection or a chemical 
cleavage method. See, e.g., Cotton etal.. Proc. Nad. Acad. Sa\, USA, S5; 4397-4401 (1985). 

Cells carrying mutations or polymorphisms in the gene of the invention may also be detected at the DNA level by 
a variety of techniques, to allow for serotyping, for example. For example, RT-PGR can be used to detect mutations. 
It is particularly preferred to used RT-PGR in conjunction with automated detection systems, such as, for example, 
GeneScan. RNA or cDNA may also be used for the same purpose, PGR or RT-PGR. As an example, PGR primers 
com piemen tary4G-a-nuGt8iG-aGid-enGGK^ing-RNaseP-Gan-be4Jsed4o kdentif y-and-analyze-mutatiGns^^ 
resentative primers are shown in the Examples. The invention further provides these primers with 1 , 2, 3 or 4 nucleotides 
removed from the 5' and/or the 3' end. These primers may be used for, among other things, amplifying RNaseP DNA 
isolated from a sample derived from an individual. The primers may be used to amplify the gene isolated from an 
infected individual such that the gene may then be subject to various techniques for elucidation of the DNA sequence. 
In this way, mutations in the DNA sequence may be detected and used to diagnose infection and to serotype and/or 
classily the infectious agent. 

The invention further provides a process for diagnosing, disease, preferably bacterial infections, more preferably 
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infections by Staphylococcus aureus, and most preferably disease, such as, infections of the upper respiratory tract 
(e.g., otitis media, bacterial tracheitis, acute epiglottitis, thyroiditis), lower respiratory (e.g., empyema, lung abscess), 
cardiac (e.g., infective endocarditis), gastrointestinal (e.g., secretory diarrhoea, splenic absces, retroperitoneal ab- 
scess), CNS (e.g., cerebral abscess), eye (e.g., blepharitis, conjunctivitis, keratitis, endophthalmitis, preseptal and 

5 orbital cellulitis, darcryocystitis), kidney and urinary tract (e.g., epididymitis, intrarenal and perinephric absces, toxic 
shocl^ syndrome), skin (e.g.. impetigo, folliculitis, cutaneous abscesses, cellulitis, wound infection, bacterial myositis) 
bone and joint (e.g., septic arthritis, osteomyelitis), compnsing determining from a sample derived from an individual 
a increased level of expression of polynucleotide having the sequence of Table 1 [SEQ ID NO: 1], Increased or de- 
creased expression of RNaseP polynucleotide can be measured using any on of the methods well known in the art for 

10 the quantitation of pclynucleotides, such as, for example, amplification, PGR, RT-PCR, RNase protection. Northern 
blotting and other hybridization methods. 

In addition, a diagnostic assay in accordance with the invention for detecting over-expression of RNaseP protein 
compared to normal control tissue samples may be used to detect the presence of an infection, for example. Assay 
techniques that can oe used to determine levels of a RNaseP protein, in a sample derived from a host are well-known 

15 to those of skill In the art. Such assay methods include radioimmunoassays, competitive-binding assays, Western Blot 
analysis and ELISA assays. 

Antibodies 

20 The polypeptides of the invention or variants thereof = or cells expressing them can be used as an immunogen to 

produce antibodies immunospecific for such polypeptides. "Antibodies" as used herein includes monoclonal and pol- 
yclonal antibodies, chimeric, single chain, simianized antibodies and humanized antibodies, as well as Fab fragments, 
including the products of an Fab immunolglobulin expression library. 

The Fab fragment may also be prepared from its parent monoclonal antibody by enzyme treatment, for example 
25 using papain to cleave the Fab portion from the Fc portion. 

Antibodies generated against the polypeptides of the invention can be obtained by administering the polypeptides 
or epitope-bearing fragments, analogues or cells to an animal, preferably a nonhuman, using routine protocols. The 
antibody so obtained will then bind the polypeptides itself. In this manner, even a sequence encoding only a fragment 
of the polypeptides can be used to generate antibodies binding the whole native polypeptirie-?. Such antibodies can 
30 then be used to isolate the polypeptide from tissue expressing that polypeptide. For preparation of monoclonal anti- 
bodies, any technique known in the art that provides antibodies produced by continuous eel! line cultures can be used. 
Examples include various techniques, such as those in Kohler, G. and Mllstein, C, Nature 255:495-497 (1 975); Kozbor 
etaL, Immunology Today 4: 72 (1983); Cole et al., pg. 77-96 in MONOCLONAL ANTIBODIES AND CANCER THER- 
APY, Alan R, Liss. Inc. (1 985), )), and the EBV-hybridoma technique to produce human monoclonal antibodies (Cole, 
55 et aL, 1985, in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

The hybridomas are screened to select a cell line with high binding affinity and favorable cross reaction with other 
staphylococcal species using one or more of the original polypeptide and/or the fusion protein. The selected cell line 
is cultured to obtain the desired Mab. 

Techniques for the production of single chain antibodies (U.S. Patent No. 4,946,778) can be adapted to produce 
40 single chain antibodies to polypeptides of this invention. Also, transgenic mice, or other organisms such as other mam- 
mals, may be used to express humanized antibodies. 

Alternatively phage display technology may be utilized to select antibody genes with binding activities towards the 
polypeptide either from repertoires of PCR amplified v-genes of lymphocytes from humans screened for possessing 
anti-RNaseP or from naive libraries (McCafferty. J. et aL, (1990). Nature 348, 552-554; Marks, J. et al.. (1992) Bio- 
45 technology 10, 779-733). The affinity of these antibodies can also be improved by chain shuffling (Clackson, T. et aL, 
(1991) Nature 352, 624-628). 

The antibody should be screened again for high affinity to the polypeptide and/or fusion protein. 
As mentioned above, a fragment of the final antibody may be prepared. 

The antibody may be either intact antibody of f^/l^ approx 150,000 or a derivative of it, for example a Fab fragment 
so or a Fv fragment as described in Skerra, A and Pluckthun, A., Science 240:1038-1040 (1968). If two antigen binding 
domains are present each^oiriain may be directed against a diffefent"ep?tope - lenTied"*&ispeciRc'"aTilib'odies: 

If two antigen binding domains are present each domain may be directed against a different epitope - termed'bis- 
pecific' antibodies. 

In panicular derivatives which are slightly longer or slightly shorter than the native protein or polypeptide fragment 
55 cf the present invention may be used. In addition, polypeptides in which one or more of the amino acid residues are 
modified may be used. Such peptides may. for example, be prepared by substitution, addition, or rearrangement of 
amino acids or by chemical modification thereof. All such substitutions and modifications are generally well known to 
those skilled in the art of peptide chemistry. 
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The above -described antibodies may be employed to isolate or to identify clones expressing the polypeptides to 
purify the polypeptides by affinity chromatography 

Thus, among others, antibodies against RNaseP- polypeptide may be employed to treat infections, particularly 
bacterial infections and especially disease, such as, infections of the upper respiratory tract (e.g., otitis media, bacterial 
5 tracheitis, acute epiglottitis, thyroiditis), lower respiratory (e.g., empyema, lung abscess), cardiac (e.g., infective endo- 
carditis), gastrointestinal (e.g., secretory diarrhoea, splenic absces, retroperitoneal abscess), CNS (e.g., cerebral ab- 
scess), eye (e.g., blepharitis, conjunctivitis, keratitis, endophthalmitis, preseptal and orbital cellulitis, darcryocystitis), 
kidney and urinary tract (e.g.. epididymitis, intrarenal and perinephric absces, toxic shock syndrome), skin (e.g., im- 
petigo, folliculitis, cutaneous abscesses, cellulitis, wound infection, bacterial myositis) bone and joint (e.g., septic ar- 
10 thritis, osteomyelitis). 

Preferably the antibody is prepared by expression of a DNA polymer encoding said antibody in an appropriate 
expression system such as described above for the expression of polypeptides of the invention. The choice of vector 
for the expression system will be determined in part by the host, which may be a prokaryotic cell, such as E. coli 
(preferably strain B) or Slreptomyces sp. or a eukaryotic cell, such as a mouse CI 27, mouse myeloma, human HeLa, 

IS Chinese hamster ovary, filamentous or unicellular fungi or insect cell. The host may also be a transgenic animal or a 
transgenic plant (for example, as described in Hiatt, A. et al. Nature 340:76-78(1989). Suitable vectors include plas- 
mids. bacteriophages, cosmids and recombinant viruses, derived from, for example, baculoviruses and vaccinia. 

Polypeptide variants include antigen ically, epitopically or immunologicaily equivalent variants that form a particular 
aspect of this invention. The term "antigenically equivalent derivative" as used herein encompasses a polypeptide or 

20 its equivalent which will be specifically recognized by certain antibodies which, when raised to the protein or polypeptide 
according to the inventkDn, interfere with the immediate physical interaction between pathogen and mammalian host. 
The term "immunologically equivalent derivative" as used herein encompasses a peptide or its equivalent which when 
used in a suitable formulation to raise antibodies in a vertebrate, the antibodies act to interfere with the immediate 
physical interaction between pathogen and mammalian host. 

25 The polypeptide, such as an antigenically or immunologically equivalent derivative or a fusion protein thereof is 

used as an antigen to immunize a mouse or other animal such as a rat or chicken. The fusion protein may provide 
stability to the polypeptide. The antigen may be associated, for example by conjugation, with an immunogenic carrier 
protein for example bovine serum albumin (BSA) or keyhole limpet haemocyanin (KLH). Alternatively a multiple anti- 
gen ic peptide comprising multiple copies of the protein or polypeptide, or an antigenically cr innmunclcgically equrvalent 

30 polypeptide thereof may be sufficiently antigenic to improve immunogenicity so as to obviate the use of a carrier. 

Preferably, the antibody or variant thereof is modified to make it less immunogenic in the individual. For example, 
if the individual is human the antibody may most preferably be "humanized"; where the complimentarity determining 
region(s) of the hybridoma-derived antibody has been transplanted into a human monoclonal antibody , for example 
as described in Jones, R et aL (1956), Nature 321. 522-525 or Tempest et al., (1991) Biotechnology 266-273. The 

35 humanized monoclonal antibody, or its fragment having binding activity, form a particular aspect of this invention. 

The modification need not be restricted to one of "humanization"; other primate sequences (for example Newman, 
R. et aL, Biotechnology 1 0: 1 455-1 460 ( 1 992)) may also be used. 

The use of a polynucleotide of the invention in genetic immunization will preferably employ a suitable delivery 
method such as direct injectbn of plasmtd DNA into muscles (Wolff et al., Hum Mol Genet 1 992, 1:363, Manthorpe et 

40 al.. Hum. Gene Ther. 1 963:4, 419), delivery of DNA complexed with specific protein carriers (Wu et al., J Biol Chem, 
1 989: 264,16985), coprecipitation of DNA with calcium phosphate (Benvenisty & Reshef, PNASUSA 1986:83,9551). 
encapsulation of DNA in various forms of liposomes (Kaneda et al., Science 1989:243,375), particle bombardment 
(Tang et al., Nature 1992, 356:152, Eisenbraun et al., DNA Cell Bioll Q93, 12:791) and in vivo infection using cloned 
retroviral vectors (Seeger et aL. PNAS USA 1984:81,5849). 

45 

Vaccines 

Another aspect of the invention relates to a method for inducing an immunological response in an individual, par- 
ticularly a mammal which comprises inoculating the individual with RNaseP, or a fragment or variant thereof, adequate 

50 to produce antibody and/ or T cell immune response to protect said individual from infection, particularly bacterial 

[nfection~and most~particu1arly-S/ap/7ytococcus"auret/s"infection— Also provided-are*^ 

logical response slows bacterial replication. Yet another aspect of the invention relates to a method of inducing immu- 
nological response in an individual which comprises delivering to such individual a nucleic acid vector to direct expres- 
sion of RNaseP. or a fragment or a variant thereof, for expressing RNaseP, or a fragment or a variant thereof in vivo 

S5 in order to induce an immunological response, such as, to produce antibody and/ or T ceil immune response, including, 
for example, cytokine -producing T ceils or cytotoxic T cells, to protect said individual from disease, whether that disease 
is already established within the individual or not. One way of administering the gene is by accelerating it into the 
desired cells as a coaling on particles or otherwise. Such nucleic acid vector may comprise DNA, RNA, a modifi d 
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nucleic acid, cr a DNA/RNA hybrid. 

A further aspect of the invention relates to an immunological composition which, when introduced into an individual 
capable or having induced within it an immunological response, induces an immunological response in such individual 
to a RNaseP or protein coded therefrom, wherein the composition comprises a recombinant RNaseP or protein coded 

5 therefrom comprising DNA which codes for and expresses an antigen of said RNaseP or protein coded therefrom. The 
immunological response may be used therapeutically or prophylactically and may take the form of antibody immunity 
or cellular immunity such as that arising from CTL or CD4+ T cells. 

A RNaseP polypeptide or a fragment thereof may be fused with coiDrotein which may not by itself produce anti- 
bodies, but is capable of stabilizing the first protein and producing a fused protein which will have immunogenic and 

10 protective properties. Thus fused recombinant protein, preferably further comprises an antigenic co-protein, such as 
lipoprotein D from Hemophilus influenzae, Glutathione-S-transferase (GST) or beta-galactosidase, relatively large co- 
proteins which solubilize the protein and facilitate production and purification thereof. Moreover, the co-protein may 
act as an adjuvant in the sense of providing a generalized stimulation of the immune system. The co-protein may be 
attached to either the amino or carboxy temninus of the first protein. 

15 Provided by this invention are compositions, particularly vaccine compositions, and methods comprising the 

polypeptides or polynucleotides of the invention and immunostimulatory DNA sequences, such as those described in 
Sato, Y. etaL Science 273: 352 (1996). 

Also, provided by this invention are methods using the described polynucleotide or particular fragments thereof 
which have been shewn to encode non-variable regions of bacterial cell surface proteins in DNA constructs used in 

20 such genetic immunization experiments in animal models of infection with Staphylococcus aureus will be particularly 
useful for identifying protein epitopes able to provoke a prophylactic or therapeutic immune response. It is believed 
that this approach will allow tor the subsequent preparation of monoclonal antibodies of particular value from the req- 
uisite organ of the animal successfully resisting or clearing infection for the development of prophylactic agents or 
therapeutic treatmenis of bacterial infection, particularly Staphylococcus aureus infection, in mammals, particularly 

25 humans. 

The polypeptide may be used as an antigen for vaccination of a host to produce specific antibodies which protect 
against invasion of bacteria, for example by blocking adherence of bacteria to damaged tissue. Examples of tissue 
damage include wounds in skin or connective tissue caused, e.g., by mechanical chemical or thermal damage or by 
impiantauon of ii idweliing devices, or wounds in the mucous membranes, such as the mouthy mammary glands, urethra 
30 or vagina. 

The invention also includes a vaccine formulation which comprises an immunogenic recombinant protein of the 
invention together with a suitable carrier Since the protein may be broken down in the stomach, it is preferably admin- 
istered parenterally, including, for example, administration that is subcutaneous, intramuscular, intravenous, or intra- 
dermal. Formulations su itable for parenteral administration include aqueous and non-aqueous sterile injection solutions 

55 which may contain anti-oxidants, buffers, bacteriostats and solutes which render the formulation insotonic with the 
bodily fluid, preferably the blood, of the individual; and aqueous and non-aqueous sterile suspensions which may 
include suspending agents or thickening agents. The formulations may be presented in unit<lose or multi-dose con- 
tainers, for example, sealed ampules and vials and may be stored in a f reeze-dried condition requiring only the addition 
of the sterile liquid carrier immediately prior to use. The vaccine formulation may also include adjuvant systems for 

^0 enhancing the immunogenicity of the formulation, such as oil-in water systems and other systems known in the art. 
The dosage will depend on the specific activity of the vaccine and can be readily determined by routine experimentation. 

While the invention has been described with reference to certain RNaseP protein, it is to be understood that this 
covers fragments of the naturally occurring protein and similar proteins with additions, deletions or substitutions which 
do not substantially affect the immunogenic properties of the recombinant protein. 

45 

Compositions, kits and administration 

The invention also relates to compositions comprising the polynucleotide or the polypeptides discussed above or 
their agonists or antagonists. The polypeptides of the invention may be employed in combination with a non-sterile or 

50 sterile carrier or carriers for use with cells, tissues or organisms, such as a pharmaceutical carrier suitable for admin- 
istration to a subject. Such compositions comprise, for instance, a media adcitfive or a therapeutically^effective amount" 
of a polypeptide of the invention and a pharmaceutically acceptable carrier or excipient. Such carriers may include, 
but are not limited to, saline, buffered saline, dextrose, water, glycerol, ethanol and combinations thereof. The formu- 
lation should suit the mode of administration. The invention further relates to diagnostic and pharmaceutical packs and 

55 kits comprising one or more containers filled with one or more of the ingredients of the aforementioned compositions 
of the invention. 

Polypeptides and other compounds of the invention may be employed alone or in conjunction with other com- 
pounds, such as therapeutic compounds. 
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The pharmaceutical compositions may be administered in any effective, convenient manner including, for instance, 
administration by topical, oral, anal, vaginal, intravenous, intraperitoneal, intramuscular subcutaneous, intranasal or 
intradermal routes among others. 

In therapy or as a prophylactic, the active agent may be administered to an individual as an injectable composition, 
5 for example as a sterile aqueous dispersion, preferably isotonlc. 

Alternatively the composition may be formulated for topical application for example in the form of ointments, creams, 
lotions, eye ointments, eye drops, ear drops, mouthwash, impregnated dressings and sutures and aerosols, and may 
contain appropriate conventional additives, including, for example, preservatives, solvents to assist drug penetration, 
and emollients in ointments and creams. Such topical formulations nnay also contain compatible conventional carriers, 
10 for example cream or ointment bases, and ethanol or oleyl alcohol for lotions. Such carriers may constitute from about 
1% to about 98% by weight of the formulation; more usually they will constitute up to about 80% by weight of the 
formulation. 

For administration to mammals, and particularly humans, it is expected that the daily dosage level of the active 
agent will be from 0 01 mg/kg to 10 mg/kg, typically around 1 mg/kg. The physician in any event will determine the 
'5 actual dosage which will be most suitable for an individual and will vary with the age, weight and response of the 
particular individual. The above dosages are exemplary of the average case. There can. of course, be Individual in- 
stances where higher or lower dosage ranges are merited, and such are within the scope of this invention. 

In-dwelling devices include surgical implants, prosthetic devices and catheters, i.e., devices that are introduced 
to the body of an individual and remain in position for an extended time. Such devices include, for example, artificial 
20 joints, heart valves, pacennakers, vascular grafts, vascular catheters, cerebrospinal fluid shunts, urinary catheters, 
continuous ambulatory peritoneal dialysis (CAPD) catheters. 

The composition of the invention may be administered by injection to achieve a systemic effect against relevant 
bacteria shortly before insertion of an in-dwelling device. Treatment may be continued after surgery during the in-body 
time of the device. In addition, the composition could also be used to broaden perioperative cover for any surgical 
25 technique to prevent bacterial wound infections, especially Staphylococcus aureus wound infections. 

Many orthopaedic surgeons consider that humans with prosthetic joints should be considered for antibiotic proph- 
ylaxis before dental treatment that could produce a bacteremia. Late deep infection is a serious complication sometimes 
leading to loss of the prosthetic joint and is accompanied by significant morbidity and mortality. It may therefore be 
possible to extend the use of the active agent as a replacement for prophylactic antibiotics in this situation, 
30 In addition to the therapy described above, the compositions of this invention may be used generally as a wound 

treatment agent to prevent adhesion of bacteria to matrix proteins exposed in wound tissue and for prophylactic use 
in dental treatment as an alternative to, or in conjunction with, antibiotic prophylaxis. 

Alternatively, the composition of the invention may be used to bathe an indwelling device immediately before in- 
sertion. The active agent will preferably be present at a concentration of 1ug/ml to 10mg/ml for bathing of wounds or 
55 indwelling devices. 

A vaccine composition is conveniently in injectable form. Conventional adjuvants may be employed to enhance 
the immune response. A suitable unit dose tor vaccination is 0.5-5 microgrann/kg of antigen, and such dose is preferably 
administered 1-3 times and with an interval of 1-3 weeks. With the indicated dose range, no adverse toxicological 
effects will be observed with the compounds of the invention which would preclude their administration to suitable 
^0 individuals. 

Each reference disclosed herein is incorporated by reference herein in its entirety. Any patent application to which 
this application claims priority is also incorporated by reference herein in its entirety. 

Examples 

45 

The present invention is further described by the following examples. The examples are provided solely to illustrate 
the invention by reference to specific embodiments. These exemplifications, while illustrating certain specific aspects 
of the invention, do not portray the limitations or circumscribe the scope of the disclosed invention. 

All examples were carried out using standard techniques, which are well known and routine to those of skill in the 
50 art, except where otherwise described in detail. Routine molecular biology techniques of the following examples can 

be-car^ied-Gut-as-described^n-standard^aborato^y-manualsrSueh-as-Sambrook-et-al— A/f0LECyL/\^^ 

LABORATORY MANUAL, 2nd Ed.: Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (19S9). 
All parts or amounts set out in the following examples are by weight, unless otherwise specified. 
Unless otherwise stated size separation of fragments in the examples below was carried out using standard tech- 
55 niques of agarose and polyacrylamide gel electrophoresis ('PAGE") in Sambrook et ai., MOLECULAR CLONING. A 
LABORATORY MANUAL, 2ndEd., Cold Spring Harbor Laboratory Press. Cold Spring Harbor N.Y. (1989) andnumer- 
ous other references such as. for instance, by Goeddel et al., Nucleic Acids Res, 8: 4057 (1980). 
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ExampI 1. PGR with random primer 



This method describes a rapid way to obtain additional sequence data from partial gene fragments independent 
of probing a genomic library. 

5 Random sequencing of a S. aureus genomic library followed by sequence homology searching with the B. subtilis 

P-protein sequence resulted in the identification of a 324 base pair (herein "bp") fragment. The first 193 nucleotides 
of this fragment showed significant homology to the C-terminal half of the B. subtilis P-protein and other prokaryotic 
RNase P proteins. The N-terminal domain of the putative S. aureus RNase P protein was putatively missing. 

Deduced S. aureus RNase P protein sequence versus 6. subtilisshows 58.7% similarity, 34.9% identity (see Figure 

10 2 for an alignment of RNase P protein sequences). 

A novel two step PGR with two different reverse primers complementary to the known sequence in position 15-39 
and 194-215 respectively (shown below, Table 2) as well as random hexamer primers (Gibco) was used to obtain the 
complete 5' sequence of the S. aureus spp gene. S. aureus genomic DMA partially digested with Hind III or Pst I served 
as a template. Primer #1 (position 194-215) annealed just downstream of the stop codon TAA and primer #2 (position 

15 15-39) close to the end of the known sequence (see Table 2). 
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Table 2 
Primer Position 

5 ' OAATT CCCTO TTAAO AAACA XQXTT AAAAG AGCAA TACCT GAAXX TTTCA XAGTA CXTAA GTCGM 

3'C TTTGT TCTAA TTTTC TCGTT ATGC 5' CSEQ ID NO: 7] 
ATATA frOGK CAAAG ATATT ATTOM TATAO NAAOA CAOGC AGCTA AAGAT ATGAC GACTT TACAA 



ATACA GNATA GTCTT GAOCA COTNC TTAAA ATTOQ CAAAG TTTTT AATAA AAAGA TTAAG TAAGG 

3'CC 

ATAGG GTAGG GAATG AAAAC ATTAA CCCCT CAACG CATCC CGAAG TCTTA CCTCA GACAA ACGTT 
TATCC CATCC CTTAC ZTTTG 5' [SEQ ID NO: 8] 

AGACT GACCC TAGGG TTAAG ACTTG GCCCN AGGGN TATNN 7AAC7 TACTT TAAAA TGTTT TCAC 3' 



30 In the first step, the reverse strand encoding the C-terminus and upstream the unknown N-terminal sequence of 

the protein was amplified using primer #1, resulting in single stranded products as diagrammed in Figure 9. This am- 
plification of the desired sequence favoured the binding of random primers to that sequence in the next step. 

In the second step, 32P-€nd-labeled primer #2 and random hexamers were added and the annealing temperature 
was dropped to 25**C to allow the short random primers to anneal to the DNA, 

35 The PGR is expected to generate many different fragments, however the major products should be those primed 

by primer #2 and a random primer (see below). Only these products were of interest as they should encode the N- 
terminal half of the protein and they could be monitored since they would be radiolabeled. After separation of the PGR 
products by gel electrophoresis and exposure to film, a limited number of radiolabeled bands should be visible on the 
autoradiogram. This method is diagrammed in Figure 10. These fragments can subsequently be cloned into a suitable 

40 vector, e.g., pUCl9, and sequenced using routine methods. 

Example 2. PGR of the spp 5' domain with random primers 

The 5' half of the spp gene was amplified in a two step PGR reaction as described herein. The PGR products were 
45 cleaned using a QlAquick PGR purification column and recovered in 50 ^il water. The sample was concentrated to a 
final volume of 25 )il under vacuum in a SpeedVac® SG110 (Savant). 20^l were loaded onto a 1 .6 mm, 8% polyacry- 
lamide gel containing 7M urea and analyzed by gel electrophoresis. The gel was stained with ethidium bromide (1 \ig/ 
ml in HgO) to visualize the DNA and photographed. The gel was then dried and exposed to film. 

There were several defined bands visible both on the ethidium bromide stained gel as well as on the autoradiogram. 
50 They ranged in size between 50 bp to over 2000 bp, some bands being more intense than others. The most prominent 
b"an'ds~^ivere~approximaie1y"50bpn'50bpr400bp~and'800bpin-size^ 

Example 3. Shotgun cloning of the PGR fragments: Cloning and sequencing of the PGR fragments 

55 The PGR fragments were blunt end cloned into Smai cut pUG19. Ligations of 100 ng vector DNA with 1.0 \i\ and 

1.5 |il of PGR products wore performed at 16'G overnight, E. co// XL1 blue cells (a-complementing) were transformed 
by electroporation and plated onto selective plates containing IPTG and x-gal (for blue/white screening). 
Transformation of these cells resulted in approximately 8.5% white colonies in the presence of PGR products, whereas 
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only 4.6% white colonies could be found in the control ligation without insert. White colonies had lost the ability to fomn 
the p-galactostdase holoenzyme as a result of a reading frame disruption in the lacZ'gene and wer<B no longer able to 
cleave x-gal. The reading frame was either disrupted by an insert (in the presence of PGR products) or exonuclease 
activity of the restriction enzyme preparation, destroying the Sma I site of the vector by cleaving terminal nucleotides 
5 and resulting in frame shifts after ligation. 

18 white colonies were picked and grown up as 2 ml cultures to prepare plasmids. A sample of each plasmid was 
linearized with AccI restriction endonuclease (cuts in the multi cloning site) and run on a 1% TAE agarose gel alongside 
the original pUCl9. 

The results showed that 7 plasmids released a short fragment upon restriction digest. As there is only one AccI 
10 site in pUC19, these recombinant plasmids must contain an insert with a restriction site tor this enzyme. 3 plasmids 
appeared larger than pUC 19 but did not release a fragment. The other 6 plasmids were of the same size as pUC19 
and did not appear to have an insert. 

The 10 plasmids containing an insert were sequenced both with the forward and reverse pUC/M13 sequencing 
primers. 

15 5 of the 6 plasmids that released a fragment after AccI digest harboured an 213bp insert that started with the 

primer #2 sequence and the known 14bp followed by a sequence of 174 nucleotides that was common to all of the 
plasmids (Sequence 1. Table 3). In one of the 6 plasmids the insert (Sequence 2. Table 3) was slightly different; the 
first 137 nucleotides were in common with Sequence I (dotted line), however the insert sequence extended beyond 
this and deviated from Sequence I (see Table 3): 

20 

Tabic 3 

25 Aplification Product Sequence 1, 2 and 3 

Sequence 1 [SEQ ID NO: 9) 

S'GTTTT CATTC CCTAC CCTAT CCTTA CTTAA TCTTT TTATT 
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GAATG 


CAGAT 


TTTCA 


GAGAA 


TATAT 


AAAAA 


AGGTC 


ATTCT GTAGC 


CAACA 


GACAA 


TTTGT 


TGTAT 


ACACT 


TGTAA 


TAATA 


AAGAA ATAGA 


CCATT 


TTCGC 


TTAGG 


TATTA 


GTGTT 


TCTAA 


AAAAC 


TAanr AATGC 


ACiTST 


TAAGA 


ACAAG 


ATTAA 


AAGAG 


CAATA 


CGT 3 


1 
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Sequence 2 (SEQ ID NO: 10) 

5 ' ACGAT AAATA ACAGT AllSTT ATTGG AAAAA GTCTA CCGAA TTAAA 

AAGAA TGCAG ATTTT CAGAG , AATAT M 
CAGAC AATTT GTTGT ATACA,,.CT^^ 

GCTTA GGTAT TAr.jn GCAGT GTTAA GAACA 

AGATT AAAAG AGCAA TACGT 3' 

Sequences [SEQ ID NO: 11] 

5 ' . . . GAAAA TTTCA AAGTA CATAA GTCGC ATATA TTGGC CAAAG ATTGT 
AATAG CAAGA CAGCC AGCTA AAGAT ATGAC GACTT TACAA ATACA GAATA 
GTCTT GAGCA CGTAC TTAAA ATTGC CAAAG TTTTT AATAA AAAGA TTAAG 
TAA GG ATAGG GTAGG GAATG AAAAC 3 ' 

Stop codon (underlined) 

Primer #2 sequence is shown in bold and also the known 14 bases underlined (3 changes to known sequence in 
italic). 

The predicted translation products of Sequence 1 and 2 showed significant homology to B. subtiiis P-protein and 
indicated that these inserts contain the full sequence information for the 5' domain of the spp gene. 

In 2 plasmlds the insert started with the sequence of primer ft^ fcllcwcd by 3' half of the spp geno (Sequence 3, 
Table 3). This sequence was already known but the new sequence showed 9 base changes (shown in bold) in com- 
parison to the original sequence data obtained by random sequencing of the S. aureus library. 

2 plasmlds contained inserts with different sequences that are a result of unspecific priming during PGR. 

Example 4 Analysis of the sequence data 

The newly obtained sequence data was drawn together and predicted to represent the complete sequence of the 
spp structural gene. The reading frame was determined from a partial sequence of the 3* domain. According to this 
reading frame the start codon ATG could be identified 174bp upstream of primer jf2 sequence (see above). The total 
length of the gene is 354bp. 

The nucleotide sequence of the spp gene is shown in Table 1 [SEQ ID NO: 2]; the start codon (ATG) and stop 
codon (TAA) are both shown in bold. The deduced translation product of the spp gene (SP protein) is 117 amino acids 
is also shown in Table 1 [SEQ ID NO:2]. 

The translated sequence was aligned with known amino acid sequences of the RNase P protein from 5 different 
prokaryotes. These alignments are shown inFigure2.TheS. aureus SP protein sequence matched the B. subtiiis P- 
protein sequence well, significant homology to the B. subtiiis protein. 

Example 5. PGR of the spp gene 

The newly obtained sequence data was used to design PGR primers for the amplification of the full length spp 
structural gene. 

The-PCRTesuftedin-a-Blnglei}and-of-approximate1y-36Gbp-with-anotali3fsJA-yie1d-o 

trophotometrical measurement. The DNA fragment was cleaned using a QIAquick PGR purification column and recov- 
ered in eOjjl water. 

Example 6. Cloning spp into pMalc2 

The pMalc2 vector provides a method for expressing and purifying a protein produced from a cloned gene (New 
England BioLabs) (see also. Guan. C., Li, P, Riggs, RD. and Inouye. H. (1987) Gene 67, 21-30; Maina, C.V. et aL 
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(1988) Gene 74, 365-373: Riggs, R in Ausubel, F.M. et al. (eds) Current prot. in Molecular Biol. (1992): Kellerman and 
Ferenci (1982) Methods in Enzymol. 90, 459-463: Yanlsch-Perron, C. et al (1985) Gene 33, 103-119; Zagursky. RJ. 
and Berman, M.L. (1984) Gene 27, 183-191, regarding the vector and its uses). 

The cloned gene is inserted downstream from the malE gene of E. coli, which encodes the maltose-binding-protein 

5 (MBP), resulting in the expression of an MBP fusion protein. The method uses the strong "tac" promolor under the 
control of the lac repressor and a one step purification of the fusion protein using MBP's affinity for maltose. Unique 
restriction sites are available between malE and lacZa for inserting the gene of interest, allowing blue/white screening 
of transformants. The vector also contains a recognition site for the specific Factor Xa protease located just 5' to the 
polylinker. This allows MBP to be cleaved after purification without leaving any vector derived residues attached to the 

10 protein of interest when the gene is cloned into the Xmni site. 

Example 7. Ligation of the spp PCR product into pMalc2 

1 ^g of the protein fusion vector pMalc2 was cut with XmnI and BannHI restriction endonucleases to enable direc- 
15 tional cloning of the spp structural gene. To remove the small 15 bp insert the digest was cleaned using QIAquick PCR 
purification column and the DNA recovered in 90 |il water. A sample was analysed by agarose gel electrophoresis to 
ensure complete digestion of the vector. After 30 min of incubation at 37*C with both enzymes the vector was completely 
cut. 

The spp PCR product was digested with BamHI and cleaned using a QIAquick PCR purification column. 
20 Vector and insert were ligated overnight in a 20 |il reaction at 16'C. The molar ratio of vector to insert was 1 :5. To 

prepare the DNA for electroporatlon, the ligation was desalted using QIAquick Nucleotide Removal columns. 

Example 8. Transformation of E. co//XL-1 Blue cells 

25 40 ^1 electrocompetent E. co//XL- 1 Blue cells were transformed with 2 ^1 desalted ligation reaction. Cells were 

recovered in 1ml SOC medium after electroporation. 50 |il of a 1:100 dilution were plated onto an LB plate containing 
ampicillin, IPTG and x-gal and incubated at 37*C overnight. 11 white and 1 blue colony were recovered. The 11 white 
colonies were grown up and the plasmids were isolated from the clones. Sequencing of the plasmids showed that 6 
of imG 11 plascTiiuS coniained in© correct spp insert. The resulting plasmid was denoted pMalc2::spp, and the fusion 

30 protein product MBP-SPP. 

Example 9 Growth of cells and fusion protein induction in £ co// XL-1 Blue pMalc2::spp 

A 1 liter culture of E. coli XL- 1 Blue pMalc2::spp was grown at 37'*C. 220 rpm and expression of the MBP-SPP 
3S fusion induced by adding 1mM IPTG at an A^o of 0.6. Induction was continued for 2 hours. The A^^o was monitored 
over that time period to ensure that cells were still growing during induction. 1ml samples were removed from the 
culture after 0.5, 1.0 1.5 and 2 hours, the cells pelleted and the pellet resuspended in 100 |.il SDS gel loading buffer. 
3 ut of the samples were analysed by SDS-PAGE with Coomassie staining. 

The cells were not effected by the overexpression of the fusion protein, they kept growing and after 2 hours of 
40 I PTG induction reached an A®QO of 1 .43. 

Protein extracts from samples were separated by SDS-PAGE and transferred onto a nitro<ellulose membrane by 
Western transfer. The MBP-SPP fusion protein could be immuno-detected by incubation of the membrane with anti- 
MBP rabbit serum and HRP-coupled anti-rabbit antibodies followed by ECL detection. The Western Blot revealed that 
2 proteins were induced upon IPTG addition. The major protein was of the expected size of approximately 56kD. the 
4S second protein, produced in lower amounts, was 10-15 kD larger. Analysis of the spp sequence revealed an accumu- 
lation of AUA (ile) and AGA (arg) codons that are rare in E. colL The nomaal frequency of AGA in E. coli is 0.27%, of 
AUA 0.51%. The spp sequence consists to 4.3% of AGA and 5. 1 % of AUA. These codons may cause frame shifts and 
other altered translation events when overexpressing a protein with a high abundance of rare codons. 

Another E. coli strain was available that constituively expresses the ArgU tRNA (tRNA^^*-*) and IleX tRNA 
50 (tRNA^AU) from the plasmid pRI952. 



Example 10. Transformation of E, co// W31 10 pRi952 

50 M.I electrocompetent E. co// W3110 pRI952 cells were transformed with 2 \i\ desalted ligation reaction. Cells 
55 were recovered in 1ml SOC medium after electroporation. 100 |il of the culture were plated onto selective LB plates 
(ampillin. cam) containing IPTG and x-gal and incubated at 37'C overnight. Too many colonies grew on the plate and 
could not be counted. 5 white colonies were selected and their plasmids isolated. Sequencing of the plasmids showed 
that all contained the correct spp insert. 
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Example 11. Growth of cells and fusion prot in Induction in £ co// W3110 pRI952 pMalc2::£pp 

The cells were grown and the expression of the fusion protein was induced as described above. The cells kept 
growing during the induction period from an A^oo of 0.6 to 1.45. Sanriples were taken from the culture at 0.0, 0.5, 1.5 
5 and 2.5 hours after tPTG addition and protein extracts analysed by SDS-PAGE with Coomassie staining, as well as 
Western transfer with Immunodection. Overexpression of the rare tRNAs IleX and ArgU resulted in only one MBP-SPP 
fusion protein being overexpressed during IPTG induction as detected by the anti-MBP serum. 

Example 12. Purification of the MBP-SPP fusion by affinity chromatography 

10 

Following optimal induction, the cells (E. co// W3110 pR1952 pMalc2::spp) were harvested and disrupted by son- 
ication after a freeze-thaw cycle to weaken the cell wall using known methods. The cell debris was removed by cen- 
trif ugation and the protein concentration of the crude extract determined using the BtO-RAD Protein Assay. The total 
amount of protein in the crude extract was 86 mg. A sample of the crude extract was analysed by SDS-PAGE with 

IS Coomassie staining which showed that the fusion protein was released from the ceils during sonication. 

The crude extract was then pumped through the amylose resin column, to whch the MBP-SPP fusion bound, and 
unbound protein was removed from the column by extensive washing with column buffer. The fusion protein was then 
eluted with column buffer supplemented with 1 0mM maltose using methods described herein. The eluate was collected 
as 3 ml fractions. Samples were removed during the purification procedure and analyzed by SDS-PAGE with Coomas- 

20 sie staining. Almost all of the fusion protein bound to the amylose resin, only trace amounts were found in the flow- 
through and no fusion protein was washed off the column. Fractions 4-11 contained reasonable announts of protein 
and were pooled, yiekjing in 25 ml protein solution with a concentration of 1 mg/mL 

Example 13. Factor Xa cleavage of the fusion protein and separation of MBP and SPP 

25 

500 |ig Factor Xa protease were added to the solution and the fusion protein was cleaved overnight at 4**C. A 
sample was analysed by SDS-PAGE before and after Factor Xa incubation to monitor complete cleavage. After 15 
hours almost all the fusion protein was cleaved resulting in the 42 kD MBP and SPP of approximately 14kD. 

The 3P proieiii readily precipiiaied upon Factor Xa cleavage and could be separated trom MBP by centhfugation. 

30 The SPP pellet was washed 3 times and then resuspended in 5ml column buffer. The MBP remained in solution and 
was found in the supernatant whereas the pellet consisted mainly of the SP protein and one minor contaminating 
protein, resulting in a reasonably pure preparation of SPP The total SPP yield was 2.18 mg/liter culture. The SPP 
preparation was diluted with column buffer to a final concentration of 0.32 mg/ml. 1mM DTT was added to prevent 
intermolecular disulfide bond formation between cystein residues. The addition of 7M urea denatured and apparently 

35 completely resolubilized the protein. The protein was aliquoted, snap-frozen and stored at -BO^C. 

Example 14. In vitro transcription of the SP RNA 

SP RNA was transcribed in vitroUom the plasmid pSPR using T7 RNA polymerase. pSPR is a pUCI 9 derivative 
40 that harbours the S. aureus sprgene behind a T7 promoter. The plasmid was linearized with BamHI restriction endo- 
nuclease to enable run off transcription. The Ambion MEG Ascript™ T7 kit was used for large scale transcription of the 
SP RNA. Thetranscriptbn reaction was performed at30*C to allow slow folding of the RNA into the correct confonmalion 
during synthesis. The RNA was cleaned under non-denaturing conditions using CLONTECH Chroma-Spin 30 columns 
and the RNA recoverd in 40 \i\ water. The quality of the transcript was monitored by gel electrophoresis on a 5% 
45 polyacrylamide TBE gel with 7M urea. The RNA was visualized by UV-shadowing over a TLC plate. 

In wfrotranscription of SP RNA from pSPR resulted in a single RNA product of the expected size of 401 nucleotides. 
The total RNA yield was 140 {determined spectrophotcmetrically). After cleaning, the RNA was recovered in 40 ul 
water resulting in a 29 |iM solution, that was stored at -20°C. 

50 Example 15. In v/fro transcription of £ coll pXHUA^^^ incorporating [a-32p] UTP 

E CO// ptRNA"iet a substrate for RNase R was transcribed in vitro by T7 RNA polymerase (Epicenter) in the 
presence of [a-32P] UTP to internally label the RNA. The DNA template pGem3Z-ptRNA was linearized with BstNl 
restriction endonuclease to allow run-off transcription. The transcription was performed in a 20 j^l reaction at 37''C for 
55 60min. To remove unincorporated nucleotides the 93nt RNA (sequence shown below) was cleaned under non-dena- 
turing conditions using a CLONTECH Chroma-Spin 10 column. To calculate the specific activity of the RNA probe, lul 
samples were removed before and after cleaning, mixed with 5 ml scintillation cocktail and counted in a scintillation 
counter. 
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ptRNA"'®* in pGem-3Z, sequence of the transcript (93mer) 5'-^ 3' [SEQ ID NO; 12]: 

GGGCG AAUUC GCCUC GGCUA CGUAG CUCAG UUGGU UAGAG CACAU CACUC 
AUAAU GAUGG GGUCA CAGGU UCGAA UCCCG UCGUA GCCAC CAG 

Transcription of ptRNA"™®* resulted in a 2|j.M (6 ng/|il) solution. 64.3% of the nucleotides were incorporated in the 
transcript and the specific activity was 2.171 10^ cpnViag. 

Example 16. 5' end-labeling of p6AT-1 

The chemicalty synthesized 42nt RNA molecule p6AT-1 was 5' end-labeled with high specific activity [y-^^P] ATP 
(6000^Ci/^i) using bacteriophage T4. polynucleotide kinase. The labeling was performed in a 10 |il reaction at 37"C 
for 30 min. The enzyme was heat-inactivated at 95°C for 2 min. To rennove salts, unincorporated nucleotides and the 
enzyme: p6AT-1 was get purified from a 20% polyacrylamide TBE gel with 7M urea. The RNA was extracted from the 
gel slice and precipitated with SOOmlVI NaOAc and 2.5 volumes 100% ethanol. After a spin at 14,G00g and 4**C the 
RNA pellet was washed twice with chilled 70% ethanof. The pellet was air dried and subsequently resuspended in 50 
)al water 0.5 ^il were spotted onto a filter paper disk and counted in a scintillation counter (Cherenkov analysis). 

The end-labeling resulted in a p6AT-1 preparation with an activity of 640,000 cpm/)jl. 

Example 17. Determination of the optimal buffer conditions for SP RNA reactions 

E. coti Ml RNA is able to cleave ptRNA"^®* or the minimal substrate p6AT-1 in lOOmM NH4CI. lOOmfVl TrisCI pH 
7.5, lOOmM MgCl2. The conditions for cleavage of either substrate by S. aureus SP RNA were unknown and clearly 
deviated from those optimized for E. colL A large number of different buffer conditions were investigated in order to 
determine optimal buffer conditions for either substrate cleavage by the S. aureus ribozyme. 

Cleavage reactions (20 pi) were all performed at 37'*C for 30 min with lOOnM SP RNA and an additional 200nM 
SP prctein in *ns hoioenzyrne reaciions. Substrates (ptRNAnfe^or p6AT-l ) were added in trace amounts (single-turnover 
reactions). Cleavage was monitored by size resolution of the intact precursor and the cleaved leader sequence by 
denaturing gel elctrophoresis and autoradiography or visualization on the phosphorimager (Molecular Dynamics Storm 
860). 

Example 18 Effect of KOI versus NIH4CI on p6AT-1 cleavage 

All experiments were performed in lOOmM TrisCI pH7.5, lOOmM MgC12 and 1M or 2M monovalent salt (KCI or 
NH4CI). 

There was hardly any cleavage product detectable at 1 M KCI or NH4C1. Cleavage occurred when 2M monovalent 
salt were used. Potassium chloride gave better results than ammonium chloride. 

Further optimization of the monovalent salt concentration was investigated following optimization of additional 
parameters described in the following sections. 

Example 19. Effect of the pH on p6AT-1 cleavage 

Buffers containing lOOmM MgC12, 2M KCI and lOOmM TrisCI at a pH of 7.0, 7.5 and 8.0 were tested for the ability 
to promote p6AT-1 cleavage by SP RNA. 

The substrate was cleaved in all three buffers but cleavage was very poor at pH 7.0 and optimal at pH 8.0. 

Example 20. Influence of PEG 8,000 on p6AT-1 cleavage 

rhe additiorTof "polyethylene glycoi"has"been shown to improve sui5slrate^Ieavage by E.~co//*MrRNA'rHererth~e 
effect of different PEG 8,000 concentrations on p6AT-1 cleavage by SP RNA was demonstrated. 1%, 2.5% and 5% 
PEG were added to reactions containing 1M KCI, lOOmM Tris CI pH8, lOOmM MgC12. 

The ability to cleave the substrate improved with increasing PEG concentrations, and therefore 5% PEG was 
selected to be included in subsequent reactions. 
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Example 21. MgCl2 requir ments for p6AT-1 proc sing 

The MgC12 concentration was successively reduced trom lOOmM to lOmM to determine the lowest magnesium 
ion concentration at which the SP RNA could still process the substrate. lOOmM. 50mM, 25mM and 10mM MgC12 
5 were tested in a buffer containing 2M KCI, lOOmM Tris CI pH8 and 5% PEG. 

Cleavage results were obtained at lOOmM MgC12, However SP RNA was still able to process p6AT-1 at 50mM 
and to some extend at 25mM MgCl2. No cleavage could be observed at lOmM MgC12. 

Example 22. Determination of KCt requirements for p6AT-1 cleavage 

10 

The KCI concentration was successively increased from 40mM to 1 .SM in lOOmM TriscI pHB, lOOmM MgC12 and 
5% PEG. 

No cleavage occurred at KCI concentrations below 400mM. At 400mM KCI p6AT-1 processing was still very poor 
but increased with the salt concentration to reach a maximal level of 27% at 1 .5M KCI. That was comparable to p6AT- 
75 1 cleavage by E. coli Ml RNA. where 35% of the substrate was cleaved after 20min. The best buffer conditions for 
p6AT-1 cieage by SP RNA were 1.5M KCI, lOOmM TrisCI pH8, lOOmM MgCI2, 5% PEG. 

Example 23. KCI requirements for ptRNA*"®^ cleavage by SP RNA 

20 ptRNA"'^' was cleaved under the same buffer conditions as described above. ptRNA"^* was a better substrate 

than p6AT-l . By contrast to p6AT-1 , ptRNAmet could be processed at KCI concentrations below 400mf^. Some cleav- 
age could already be detected at 20mM KCI, whereas no cleavage of p6AT-1 was obsen^ed under these conditions. 
The cleavage rate increased with the KCI concentration. At 150mM KCI more than 50% of the substrate was processed, 
reaching S2% at SOOmM KCI. At concentrations of 1 M or above 85% of the ptRNA were cleaved, and no further increase 

25 could be detected, indicating that 1 5% of the substrate was uncleavable. E coli Ml RNA as a positive control cleaved 
SO%ofthe substrate in lOOmf^ Tris.CI pH7.5, 100mMNH4CI. lOOmM MgCI 2 within 20m in. The best buffer conditions 
for ptRNA^«» cleavage by SP RNA were 1M KCI, lOOmM TrisCI pH8, lOOmM MgC12, 5% PEG. 

Eyampje 24. Determination of the cptirrss! buffer ccrsditicns for S. sursusni^ase P MOioenzyme feaciions: p6AT- 
30 1 cleavage by the holoenzyme 

The S. aureus RNase P holoenzyme cleavage of p6AT-l was investigated over a range of different KCI concen- 
trations at low magnesium ion concentrations. All reactions were performed in lOOmM Tris CI pHB, lOmM MgCI2, 5% 
PEG and 40mM to 1 .5M KCI. Cleavage could already be observed at 40mM KCI. The cleavage rate increased with 

35 the concentration of monovalent salt, peaking at 150mM, where 50% of the substrate was processed. At higher KCI 
concentrations the cleavage rate decreased again and p6AT-1 processing did not occur at KCI concentrations of 600mM 
and above. The E. cc// holoenzyme control cleaved 57% of the p6AT-1 within 20min in lOOmM Tris CI pH7.5, lOOmM 
NH4CI, lOmM MgCI2. The optimal buffer for p6AT-1 cleavage by the hobenzyme was 150mM KCI, lOOmM Tris.CI pH 
6.0, lOmM MgClg. 5% PEG 8000. p6AT-1 wa s a better substrate for the holoenzyme than for SP RNA, as under optimal 

^0 conditions only 27% p6 AT-1 was cleaved as opposed to 50% by the S. aureus RNase P holoenzyme. 

Example 25. MgClj requirements for ptRNA cleavage by the holoenzyme 

The S. aureus RNase P holoenzyme failed to cleave ptRNA'"®* under the same conditions employed for p6AT-1 
^5 processing. The holoenzyme was not able to cleave the substrate at lOmM MgCl2 under any given KCI concentration. 
The concentration of magnesium ions was increased to 20mM and ptRNA cleavage by the holoenzyme as well as the 
SP RNA ribozyme were tested at different KCI concentrations. The buffers contained 1 0OmM Tris CI pHS, 20mM MgCl2, 
5% PEG and lOmM. 50mM, 150mfy/1. SOOmM or 1.5M KCI. 

Cleavage of ptRNA by the holoenzyme at lOmM MgCl2 and low KCI concentrations was very poor. At 20mM 
50 MgCl 2 and 1 50mM KCI the holoenzyme processed ptRNA very well, 54% of the substrate was cleaved within a 20min 

incubation. Only a low percenta g e of substrate processing could be detected at SOOmM KCI . but good cleavage oc- 

curred at 1 .5M KCI. The SP RNA alone was also able to cleave ptRNA under these conditions but was unable to cleave 
ptRNA at lower salt concentrations. Therefore the optimal MgCl2 concentration for ptRNA'^* cleavage by the holoen- 
zyme was 20m M. 

55 

Example 26 Determination of the optimal KCI concentration 

The S. aureus RNase P holoenzyme was tested for its ability to cleave ptRNA""®' under different KCI concentrations 
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ranging from 20mM to 1 .5M. The buffers contained lOOmM Tris CI pH8, 20mM MgCIa, 5% PEG 8000 and xM KCI. 

The holoenzyme was able to cleave ptRN Amet at KCI concentrations as low as 20mM. The cleavage rate increased 
with the salt concentration with a peak at 150mM where 55% of the substrate was processed. The cleavage rate 
decreased at KCI concentrations above 200mM but improved again at 1M KCI reaching a maximum at 1.5M. The 
5 cleavage at high salt concentrations was not due to holoenzyme activity but result of SP RNA ribozyme activity at high 
KCI concentrations. 

ptRNA""®* was a better substrate lor SP RNA than for the S. aureus RNase P holoenzyme. The optimal buffer 
conditions for ptRNAmet cleavage by the holoenzyme were 150mM KCI,100mf\/l Tris CI pH8, 20mM MgCtg, 5% PEG. 

10 Example 26 Strain selection, Library Production and Sequencing 

The polynucleotide having the DNA sequence given in SEQ ID NO: 1 was obtained from a library of clones of 
chromosomal DNA of Staphylococcus aureus in E. colL The sequencing data from two or more clones containing 
overlapping Staphylococcus aureus DNAs was used to construct the contiguous DNA sequence in SEQ ID NO: 1. 
Libraries may be prepared by routine methods, for example: 

Methods 1 and 2 beiow. 

Total cellular DNA is isolated from Staphylococcus aureus WCUH 29 according to standard procedures and size- 
20 fractionated by either of two methods. 

Method 1 

Total cellular DNA is mechanically sheared by passage through a needle in order to size-fractionate according to 
25 standard procedures. DNA fragments of up to 1 Ikbp in size are rendered blunt by treatment with exonuclease and 
DNA polymerase^ and EcoRI linkers added. Fragments are ligated into the vector Lambda Zapit that has been cut with 
EcoRI, the library packaged by standard procedures and Eco// infected with the packaged library. The library is am- 
plified by standard procedures. 

30 Method 2 

Total cellular DNA is partially hydrotyzed with a one or a combination of restriction enzymes appropriate to generate 
a series of fragments for cloning into library vectors (e.g.. Rsal. Pall. AluL Bsh12351). and such fragments are size- 
fractionated according to standard procedures. EcoRI linkers are ligated to the DNA and the fragments then ligated 
35 into the vector Lambda Zapll that have been cut with EcoRI. the library packaged by standard procedures, and Ecoli 
infected with the packaged library. The library 
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SEQaENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Guth, Sabine 

Jennings, Joanne 
Prescotr, Catherine 

{ii) TITLE Or THE INVE^7^I0N: NOVEL RNASE P 

(iii) NUMEROF SEQUENCES: 14 



(iv) O0RRESPOM)ENCE ADDRESS: 

(A) ADDRESSEE: SmithKiine Beecham Corporation 

25 ( B) STREET: 709 Swedeland Poad 

(C) CITY: King of Prussia 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19406-0939 



(V) COMPUTER READA^E FORM: 

35 

(A) MEDIUM TYPE; Diskette 
( B) COMPUTER: I Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 
4S (A) APPLICATION NUMBER 

( B) FILING DATE: 06-JUN-1997 
(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 



(A) APPLICATION NUM Eft 60/019, 234 
( B) FILING DATE: 06-JUN-1996 
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(A) APPLICATION NUME:?: 60/029, 928 
( B) TILING DATE: 01-NOV-i996 

5 

(A) APPLICATION NUMEP: 60/029,928 
( B) FILING DATE: Ol-NOV-1996 

10 

ATTORNEY/ AGENT INFORMATION: 
NAME: Gimmi, Edward P. 
?EGISTPAXION NUM3:ft 38, 891 
?EFE HENCE /DOCKET MUMEft P50493 

20 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: €10-270-4478 
{ B) TELEFAX: 610-270-5090 
25 (C) TELEX: 

(2) INFOPMATION FOR SEQ ID NO: 1: 

ii) SiiwuENCE CHARACTERISTICS; 
(A) LENGTH: 35 4 base pairs 
( B) TYPE: nucleic acid 

(C) ST RANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

"~ ATGTTATTGG AAAAAGCTTA CCGAATTAAA AAGAATGCAG ATTTTCAGAG AATATATAAA 6 0 

AAAGGTCAT? CTGTAGCCAA CAGACAATTT GTTGTATACA CTTGTAATAA TAAAGAAATA 120 

GACCATTTTC GCTTAGGTAT TAGTGTTT CT AAAAAACTAG GTAATGCAGT GTTAAGAAAC 18 0 

50 AAGATTAAAA GAGCAATAO; TGAAAATTTC AAAGTACATA AGTCGCATAT ATTGGCCAAA 24 0 

G"A?7vrTATTG^"raT^^^ ^DO" 

AGTCTTGAGC ACGTACTTAA AATTGCCAAA GTTTTTAATA AAAAGATTAA GTAA 35 4 

55 

(2) INFORMATION FOR SEQ ID NO: 2: 



( viii} 
(A) 
{ B) 
(CI 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 117 amino acids 
( B) TYPE: amino acid 

(C) STRANCEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Leu Glu Lys Val Tyr Arg lie Lys Lys Asn Ala Asp Phe Gly 

15 10 15 

Arg lie Tyr Lys Lys Gly His Ser Val Ala Asn Arg Gin Phe Val Val 

20 25 30 

Tyr Thr C/s Asn Asn Lys Glu He Asp His Phe Arg Leu Gly He Ser 

35 40 45 

Val Ser Lys Lys Leu Gly Asn Ala Val Leu Arg Asn Lys He Lys Arg 

50 55 60 

Ala lie Arg Glu Asn Pne Lys Vai His Lys Ser His lie Leu Piia Lys. 
65 70 75 80 

Asp He He Val He Ala Arg Gin Pro Ala Lys Asp Met Thr Thr Leu 

85 90 95 

Gin lie Gin Asn Ser Leu Glu His Val Leu Lys He Ala Lys Val Phe 

100 105 110 

Asn Lys Lys He Lys 
115 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 401 base pairs 
( B) TYPE: nucleic acid 
CC ) ST"RAM)*EDTIE'SS: sxngle 
(D) TOPOLOGY: linear 

lii) MOLECULE TYPE: Genomic DMA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



10 



15 



GTTCTGATAT 
AGATGCATTG 
ATGAAATATC 
GCTTTTATAG 
TGGTAGGAGC 
AGTGGTGTAC 
ACGATGGAAC 



TTCGGGTAAT 
TAGTGTT CGT 
CTAAGTCTTT 

ACITGTTTAA 
GACAGATGGT 
AGAACATGGC 



CGCTATATTA 
GCTTGATGAA 
CGATATGGAT 
GGTGGAACGC 
CGGAATT CI^A 
TATCACCTGA 
TTATAGAAAT 



TATAGAGGAA 
ACAATAAATC 
AGAGTAATTT 
GGTAAACCCC 
CGTATAAACG 
GTACCAGTGT 
ATCACTACTA 



AGTCCATGCr 
AAGGCATTAA 
GAAAGTGCCA 
TCGAGTGAGC 
AGACACACTT 
GACTAGTGCA 
G 



CCACAGT CTG 
TTTGACGGCA 
CAGTGACGTA 
AATCCAAATT 
CGCGAAATGA 
CGTGATGAGT 



60 
120 
180 
240 
300 
360 
401 
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25 



{21 INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 2 base pairs 
( E5 TYPE: nucleic acid 
( C) Sr RANDEDNESS: single 
(D) TOPOLOGY: linear 



30 



{-.i) MOLECULE TYPE: RNA 



35 



[xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



GAAUUCGCCC GGACUCGGUU CAGUUCCGAG UCCGGGCACC AC 



42 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
( ^ TYPE: nucleic acid 
( C) STRANDEDNESS: single 
(D; TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic D'NA 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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CGCGAAGTGT GTCTCGTTTA TACG 

(2) INFORMATION FOR SEQ ID HO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
( E) TYPE: nucleic acid 

(C) ST RANDEDNESa single 

(D) TOPOLOGY: linear 

{ii] MOLECULE TYPE: Genocnic D NA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GAGGAAAGTC CATGCTC 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 24 base pairs ■ 
{ ^ TYPE: nucleic acid 
{ C) STRANDEDNESS; single 
(D) TOPOLOGY: linear 

(ii) ysOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGTATTGCTC TTTTAATCTT GTTT 

(2) INFORMATION FOR SEQ ID NO: 8: 
50 (i) SEQUENCE CHARACTERISTICS: 



25 



30 



40 



45 



24 



17 



24 
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(A) LENGTH: 85 base pairs 

( B) TYPE: nucleic acid 

(C) STBAICEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) tOLcCULE TYPE: Genomic DMA 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GTTTTCATTC CCTACCCTAT TTGCi^AACAG ACTCCATTCT GAAGCCCTAC GCAACTCCCC 60 
AATTACAAAA GTAAGGGATG GGATA S5 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHAPACTE RI STICS: 
(A) LENGTH: 213 base pairs 
{ B) TYPE: nucleic acid 
( C) ST RANDEDNESS: single 
(D) TOPOLOGY: linear 



15 



20 



2^ (ii) MOLECULE TYPE: Genome DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



30 



35 



40 



45 



50 



GTTTTCATTC CCTACCCTAT CCTTACTTAA TCTTTTTATT AAAAAGAATG CAGATTTTCA 6 0 

GAGAATATAT AAAAAAGGTC ATTCTGTAGC CAACAGACAA TTTGTTGTAT ACACTTGTAA 120 

TAA7AAAGAA ATAGACCATT TTCGCTTAGG TATTAGTGTT TCTAAAAAAC TAGGTAATGC 180 

AGTGTTAAGA ACAAGATTAA AAGAGCAATA CGT 213 

(2) INFOBMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHABACTERI ST ICS: 
(A) LENGTH: 215 base pairs 

( 3) TYPE: nucleic acid 
(C) ST RANDEDNESS: single 
(Dl TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 10: 
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ACGATAAATA ACAGTATGTT ATTGGAAAAA GT CT ACCGAA TTAAAAAGAA TGCAGATTTT 60 

CAGAGAATAT ATAAAAAAGG TCATTCTGTA GCCAACAGAC AATTTGTTGT ATACACTTGT 120 

^ AATAATAAAG AAATAGACCA TTTTCGCTTA GGTATTAGTG TTTCTAAAAJV ACTAGGTAAT 180 

GCAGTGTTAA GAACAAGATT AAAAGAGCAA TACGT 215 

10 (2) INFORMATION FOR SEQ ID NOt 11: 

(i) SEQUENCE OiARACTE RI STI CS; 
(A) LENGTH: 17 0 base pairs 

( E) TYPE: nucleic acid 

(C) ST HANDEDNESS: single 

( D ) TOPOLOGY: li nea r 

20 

(ii) MOLECULE TYPE: Genomic DNA 

2s (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 1: 

GAAAATTTCA AAGTACATAA GTCGCATATA TTGGCCAAAG ATTGTAATAG CAAGACAGCC 60 

AGCTAAAGAT ATGACGACTT TACaAATACA GAATAGTCTT GAGCAOGTAC TTAAAATTGC 120 

30 

CAA.IVGTTTTT AATAAAAA.GA TTAAGTAAGG ATAGGGTAGG GAATGAAAAC 170 
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(2) INFORMATION FOR SEQ ID NO: 12; 

(i) SEQUENCE CHARACTERI ST ICS: 
(A) LENGTH: 93 base pairs 

( E) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



55 



"GGG'a5AAUUC~G'Caj'CGG'CUA"CGUAG"CUCAG"UDGGUUAGAG"-CACAUCA-aJC~AU-A^^ 60- 
GGUCACAGGU UCGAAUCCCG UCGUAGCCAC CAG 53 

(2) INFORMATION FOR SEQ ID NO: 13: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
( E} TYPE: nucleic acid 

(C) ST HANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
YGATATTT CK GRTAAYC 

(2) INEOBMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACT ERE ST ICS' 
2s (A) LENGTH: 137 base pairs 

C B) TYPE: nucleic acid 
(C) 3TRANDEDNESS; single 
TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTIOl* SEQ ID NO: 14: 

TGNAGACAGA TGGTTATCAC CTGAGTACCA GTGTGACTAG TGCACGTGAT GAGTACGATG 60 
GAACAGAACA TGGCTTATAG AAATAPCACT ACTAGTTTAG CTCTCCrAGA TGATGGAGAG 120 
CTTTTTTCAT GAAAAGA ^^"^ 
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Claims 

1. An isolated polynucleotide comprising a potynucleotide sequence selected from the group consisting of: 

50 (a) a potynucleotide having at least a 50% identity to a polynucleotide encoding a polypeptide comprising the 
amino-aGid-sequenGe-of-SEQ-ID-NO;2^ 

(b) a polynucleotide having at least a 50% identity to a polynucleotide comprising the nucleic acid sequence 
of SEQ ID NO: 1, 3 or 4; 

(c) a polynucleotide having at least a 50% identity to a polynucleotide encoding the same mature polypeptide 
55 expressed by the RNaseP gene contained in the Staphylococcus aureus of the deposited strain; 

(d) a polynucleotide having at least a 50% identity to a polynucleotide comprising the same catalytic RNA 
expressed by the RNaseP RNA gene contained in the Staphylococcus aureus of the deposited strain: 

(e) a polynucleotide encoding a polypeptide comprising an amino acid sequence which is at least 70% identical 
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to the amino acid sequence o< SEQ ID NO:2; 

(f ) a pofynucleotide which is complementary to the polynucleotide of (a), (b). (c), (d) or (e); and 

(g) a polynucleotide comprising at least 1 5 sequential bases of the polynucleotide of (a), (b), (c), (d). (e) or (f). 

2. The polynucleotide of Claim 1 wherein the polynucleotide is DNA. 

3. The polynucleotide of Claim 1 wherein the polynucleotide is RNA. 

4. The polynucleotide of Claim 2 comprising the nucleic acid sequence set forth in SEQ ID N0:1. 

5. The polynucleotide of Claim 2 comprising nucleotide 1 to 354 set forth in SEQ ID NO:1. 

6. The polynucleotide of Claim 2 which encodes a polypeptide comprising the amino acid sequence of SEQ ID NO:2. 

7. A vector comprising the polynucleotide of Claim 1 . 

8. A host cell comprising the vector of Claim 7. 

9. A process for producing a polypeptide comprising: expressing from the host cell of Claim 8 a polypeptide encoded 
by said DNA. 

10. A process for producing a RNaseP polypeptide or fragment comprising culturing a host of claim 8 under condftions 
sufficient for the production of said polypeptide or fragment. 

11. A polypeptide comprising an amino acid sequence which is at least 50% identical to the amino acid sequence of 
SEQ ID N0:2. 

12. A polypeptide comprising an. amino acid sequence as set forth in SEQ ID NO:2. 

13. An antibody against the polypeptide of claim 11 . 

14. An antagonist which inhibits the activity or expression of the polypeptide of claim 11. 

15. A method for the treatment of an individual in need of RNaseP polypeptide comprising: administering to the indi- 
vidual a therapeutically effective amount of the polypeptide of claim 11 . 

16. A method for the treatment of an individual having need to inhibit RNaseP polypeptide comprising: administering 
to the individual a therapeutically effective amount of the antagonist of Claim 14. 

17. A process lor diagnosing a disease related to expression or activity of the polypeptide of claim 11 in an individual 
comprising: 

(a) determining a nucleic acid sequence encoding said polypeptide, and/or 

(b) analyzing for the presence or amount of said polypeptide in a sample derived from the individual. 

18. A method for identifying compounds which interact with and inhibit or activate an activity of the polypeptide of claim 
11 comprising: 

contacting a composition comprising the polypeptide with the compound to be screened under conditions to 
permit interaction between the compound and the polypeptide to assess the interaction of a compound, such 

tnteraction-being-asse«;iated~wilh-a-second-eompGnenl-eapable-of-providing-a-deteGtable-signal-in-Tesponse 

to the interaction of the polypeptide with the compound; 

and determining whether the compound interacts with and activates or inhibits an activity of the polypeptide 
by detecting the presence or absence of a signal generated from the interaction of the compound with the 
polypeptide. 

19. A method for inducing an immunological response in a mammal which comprises inoculating the mammal with 
RNaseP polypeptide of claim 11, or a fragment or variant thereof, adequat to produce antibody and/or T cell 
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immune response to protect said animal from disease. 



20. A method of inducing immunological response in a mammal whicln comprises delivering a nucleic acid vector to 
direct expression of RNaseP polypeptide of claim 11 , or fragment or a variant thereof, for expressing said RNaseP 

5 polypeptide, or a fragment or a variant thereof in vh/o in order to induce an immunological response to produce 

antibody and/ or T cell immune response to protect said animal from disease. 

21. An isolated RNase P from Staphylococcus aureus. 

10 22. The RNase P according to Claim t having an RNA component of the sequence of SEQ ID NO: 1 

23. The RNase P according to Claim 1 comprising a protein component of the sequence of SEQ ID NO: 2. 

24. An isolated DNA encoding the components of Claim 23. 

IS 

25. A method for individual or co-expression of the DNA of Claim 24. 

26. A screen for the identification of inhibitors of RNase P comprising one or more of the RNase P components set 
forth in SEQ ID NO: 1 through 10. 

20 

27. An isolated RNA component of Staphylococcus aureus RNase P. 

28. An isolated protein component of Staphvlococcus aureus RNase P. 
25 29. An isolated DNA encoding the components of Claim 22. 

30. A polynucleotide selected from the group consisting of SEQ ID NO: 5 through 12. 
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Figure 2- 



10 20 30 40 50 60 70 

ECO MVKLAFPRELPLLTPSQFTFVFQQPQRAGTPQITILGRLNSLG HPRIGLTVAKKNVRRAHE 

Pmi MVKIJ^PRELRIJ.TPKHE1IFVFCM3PQRASSPEOTII^RQKE^ HPRIGLTIAKKNVKRAHE 

St^ MLPTENRLRRREDFATAVRRGRRAGRPLLVVHRLSGATDPHAPGSSAPPTRAGFVVS- KAVGGAVV 

Mi u MLPRDRRVRTPAEFRHLGRTGTRAGRRTVWSVATDPDQTRSTS PS APRPRAGFWS - KAVGNAVT 

BSU - -MSHLKKRNRLKKNEDFQKVFKHGTSVANRQFVL YTLDCPENDE LRVGLSVSKK- IGNAVM 

Sau - -rO- LEI:Ai?.IKKKA:)FC3IY-xKGHS vAHRO"."' -rraWKEIOH rRLGISVSKK- lgnavl 

80 90 100 no i^u 

ECO RNRJOCRLTRESFRL- - RQ HELP AMD FVWAKK- -GVADLDNRALSEALEKLWRRHCR- - LARGS - - - 
Pmi RMRIlCRLAREyFRL- -HQHQLPAMDFWLVRK- -GVAELDNKQLTEVLGKLWRRHCR- -LAQKS — - 

Shi RHQVXRRLRHLVCD--RLSALPPGSLW-VRALPGAGDADHAQLARDLDAALQR LLGGGTR-- 

Miu RHKVKimLRAVVAEQMRLPPLRDLPVLVQVRALPAAAEADyALLRREW 

BSU RlIRXXRLIRQAFLE-EKER-LKEKDyiILARK--PASQLTYEETKKSLQHLFRKSSL--YKKSSSK- 
Sau ROTIKIUIRENFKV-HKSHILA-KDIIVIARQ- -PAroMTTLQlQNSLEHVLKIAKV- -FN^ 



Escherichia cdi (Eco). Proteus mimbifis (Pmi), Streptomyces bikiniensis (SbO, 
Micrococcus luteus (Mlu), Badlius subtilis (Bsu), Stsphyiococcus aureus (Sau). 
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Figure 3. 

Percent Similarity: 78.788 Percent Identity: 78.030 

3 1 1 TGTAGATAGATGATTGCCGCCTGAGTACGAG GTGATGAG... CCGTTTG 356 

; I: I I I I I I I I 11 I I M M I li I I I I I II M III 
5 TGNAGACAGATGGTTATCACCTGAGTACCAGTGTGACTAGTGCACGTGAT 54 

357 CAGTACGATGGAACAAAACATGGCTTACAGAACGTTAGACCACTTACATT 406 

II! I !! I! II II II I II II I II II i I II I i I I I I I II 

55 GAGTACGATGGAACAGAACATGGCTTATAGAA ATATCACJACTAGT 100 

407 TAAAATGATGAAAACAAGCTCTCCCGTAT. AAGGAGAGCTTTTATCTTGA 455 

I 1 1 1 1 II 1 1 1 1 I I II 1 1 1 mil I I 1 1 1 

101 T TAGCTCTCCTAGATGATGGAGAGCTTmTCATGA 136 

456 AAAGA 460 

• I I I I I 
137 AAAGA 141 




44 



J 



EP0 811 688 A2 



Figure 4. 
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Figure 5. 
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Figure 6. 
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An excess of drug binding sites 
created by RNA fragments 
decre ases the probability of the 
drug ( H ) binding artd thus 
inhibiting the ribozyme. 
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Figure 7. 
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Figure 8. 
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Figure 9. 





10 cycles ^ 














«*- 



1 S. aureus genomic DNA 

frannriAnt /temolate) 



single-Stranded products 



50 



EP0 811 688 A2 



Figure 10. 
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